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MYC03ACTERIAL RECOMBINANTS 
AND PEPTIDES 

Descr ipt ion 

5 Cross Reference to a Related Application 

This is a continuation-in-part of co-pending 
application Serial No. 019,529 filed on February 26, 
1987, that is incorporated herein by reference. 
Technical Field 

10 The present invention relates to recombinant 

proteins and peptides related to mycobacteria, and 
particularly to proteins of Mycobac ter ium 
tuberculosi s that are coded for by adjacent open 
reading frames on complementary DNA strands of the 

15 genome and vectors for propagating and expressing 
those recombinants-, as well as to peptides that 
correspond substantially in sequence to portions of 
those proteins. 
Background Art 

20 The mycobacteria are a diverse collection of 

acid-fast, gram-positive bacteria some of which cause 
important human and animal diseases [reviewed in 
31oom et al., (1983), Rev. Infect. Pis. , ^5: 765-730 ; 
and Chaparas, (1982), CRC Reviews in Microbiology , 

25 _9 :139 " 1971 * In man ' the two most common 

mycobacter ia-caused diseases are tuberculosis and 
leprosy, which result from infections with 
Mycobacter ium tuberculosi s and Mycobacter ium leprae , 
respectively. These two diseases afflict more than 

30 65 million individuals world-wide and result in over 
4 million deaths annually, Bloom et al., (1983), Rev . 
Infect. Pis. , _5 : 765-780 . 

The pathogenicity of these mycobacterial 
infections is closely tied to the host's immune 

35 response to the invading mycobac te r ium [Chaparas, 
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(1982), CRC Reviews in Microbiology , _9:139-197; 
Collins, (1982), Am. Rev . Respir . Pis . , 125 : 42-49 ; 
Dannenberg, (1982), Am. Rev. Respir. Pis. , 125 : 25-29 ; 
and Grange, (1984), Adv . Tuberc . Res . , 2_1 :1 ~ 7 8]. Not 

5 only does tuberculos i s infect and grow within 
cells of the host's immune system, primarily the 
aveolar macrophage, but also it is the host's 
cellular immune response that plays the key roles in 
immunity from infection, containment of the infection 

10 at the initial focus of infection, progression or 
regression of the infection, and tissue damage or 
destruction at the foci of infection [Chaparas, 
(1982), CRC Reviews In Microbiology , _9 : 139-197 ; 
Collins, (1982), Am. Rev. Respir. Pis. , 125 : 4 2-4 9; 

15 Pannenberg, (1932), Am. Rev. Respir. Pis. , 125 :25-29; 
and Grange, (1984), Adv . Tuberc . Res . , 2i :1 ~ 73 1- In 
addition, the standard method of detecting an M . 
tuberculosis infection, the tuberculin skin test, 
actually measures the host's cellular immune response 

20 to the mycobacter ium [Snider, (1982), Am. Rev. 
Respir . Pis . , 125 : 108-118] « The mycobacterial 
components that are important in eliciting the 
cellular immune response are not yet well defined. 

A number of studies have attempted to define 

25 the mycobacterial antigens by standard biochemical 
and immunological techniques including the analysis 
of the target antigens of monoclonal hybridoma 
antibodies directed against mycobacteria [Paniel et 
al., (1978), Microbiol . Rev. , £2:84-113; Engers et 

30 al., (1985), Infect . Immun. , _48: 503-S05; Engers et 
al., (1986), Infect . Immun . , j5l:7l8-720; Grange, 

(1984) , Adv. Tuberc . Res . , 2i :1 " 78 ' Ivanyi et al., 

(1985) , Monoclonal Antibodies Against Bacteria (A. J. 
L. and E. C. Macario, eds.) Academic Press, Inc. New 

35 York. pp. 59-90; and Stanford, (1983), The Biology of 
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the Mycobacteria ' (Ratledge and Stanford, eds.), 
Academic Press, London, vol. 2, pp. 85-127]. 

One particular antigen, a 65 kilodalton (KD) 
protein, is present in a wide range of mycobacterial 

5 species and has been most intensively studied as an 

antigen of M. leprae [Emmrich et al., (1986), J . Sxp. 
Med. , 163:1024-1029; Gillis et al. , (1985), Infect. 
Immun. , 49: 371-377; Young et al . , (1985), Nature, 
31^:450-452; and Mehra et al., (1985) Proc. Natl. 

10 Acad. Sci. USA , 83:7013-7017]. This antigen has been 
designated the 65KD antigen or the cell wall 
protein-a (CWP-a) antigen since it appears to a 
co-purify with cell walls in some isolation 
procedures [Gillis et al., (1985), Infect. Immun. , 

15 !2 : 371-377] . 

In Western blot assays, monoclonal 
antibodies directed against this antigen react with 
' two major components in an leprae extract that 
migrate- with apparent sizes of 55,000 and 65,000 
20 daltons, and react occasionally with smaller 

components as well [Engers et al., (1985), Infect. 
immun. , £8:603-605 and Gillis et al., (1985), Infect. 
Immun . , 37:172-178]. It is not known if these 
species represent discrete proteins or precursors and 
25 products, or result from chemical or enzymatic 

cleavage during isolation. In other species, such as 
M. gordonae , only a single species of about 65,000 
daltons is detected with the monoclonal antibodies 
[Gillis et al., (1985), Infect. Immun. , 49:371-377]. 
30 The 65KD antigen is one of the major 

immunoreactive proteins of the mycobacteria. This 
antigen contains epitopes that are unique to a given 
mycobacterial species as well as epitopes that are 
shared amongst various species of mycobacteria 
35 [Engers et al., (1985), Infect. Immun. , _48: 603-605 
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and Gillis et al., (1985), Infect . Immun. , 
^9:371-377]. In addition, some other antigens that 
appear to be expressed by only one mycobacterial 
species are also found to contain epitopes expressed 

5 in other mycobacterial species. [Kingston et al., 
(1987) infect. Immun . , 55:3149,] 

As discussed hereinafter, it is now found 
that purified 65KD antigen can elicit -a strong 
delayed-type hypersensitivity reaction in 

10 experimental mammals infected with tuberculos is . 
'Antibodies directed against this protein can also be 
detected in the sera of patients with tuberculosis or 
leprosy, and T-cells reactive with this antigen can 
be isolated from patients with leprosy or 

15 tuberculosis as well as from BCG- vacc ina ted persons 

[Emmrich et al., (1<*86) , J. Exp. Med . , 153 : 1024-1029 ; 
Engers et al., (1986), Infect . Immun . , 51 : 713-720 ; 
Mustafa et al., (1986), Nature , 3_1 9 : 63-66 ; and Thole 
et al., (1985), Infect . Immun. , 50; 800-8061 . 

20 Overall, the 65KD antigen appears to be a major, 
medically important B- and T-cell immunogen and 
antigen in humans. 
Brief Summary of the Invention 

The present invention relates to DNA 

25 sequences, vectors containing the DNA sequences, 
proteins, recombinant proteins, peptides, their 
method of manufacture and use that relate to a 
Mycobacterium tuberculosis . More particularly, those 
DNA sequences, vectors, proteins, recombinants and 

30 peptides relate to two proteins denominated the 540 
(65KD) and 517 proteins that are coded for by 
adjacent open reading frames on complementary DNA 
strands of the mycobacterial genome. The peptides 
correspond substantially to portions of those 

35 proteins. 



One embodiment of the invention contemplates 
an isolated DNA molecule that consists essentially of 
a nucleotide sequence, from right to left and in the 
direction from 5 ! -end to 3'-end, corresponding to the 
sequence represented by the formula of Figure 2 from 
about position 3950 to about position 2390 and in a 
consistent reading frame coding for a 517 amino acid 
residue protein of Mycobac ter ium tuberculosis . More 
preferably, that sequence extends from position 3948 
through position 2393. 

A plasmid vector that comprises a replicon 
operationally linked to a foreign DNA sequence such 
as that above and that is capable of replicating that 
foreign DNA sequence in a replication/expression 
medium is also contemplated herein, particularly 
where the replication/expression medium is a 
unicellular organism, such as a bacterium like 
E. coli . The plasmid vector typically includes 
sequence-encoded signals for initiation and 
termination of transcription that are operationally 
linked to the foreign DNA sequence and are compatible 
with the replication/expression medium for 
transcribing a product coded for by the foreign DNA 
sequence. Further, it can include a translation 
initiation codon and a translation termination codon, 
each of which is operationally linked to the 5 f -end 
and the 3 f -end, respectively, of the DNA sequence, 
and are compatible with the replication/expression 
medium for expressing a protein product coded for by 
the foreign DNA sequence. 

Still further, the S'-end of the foreign DNA 
sequence can be operationally linked in tr anslat ional 
reading frame to the 3 1 -end of a second DNA sequence 
that codes for a second protein or protein fragment 
or portion, such as the beta-galactosidase molecule. 
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The protein product expressed by that vector is thus 
a fusion protein that contains the second protein or 
protein fragment or portion at the amino-terminus and 
the first-named protein at the carboxy- terminus of 

5 the fusion protein; i.e., the fragment or portion of 
the second protein is at the amino-terminus of the 
f i rst-named protein, 

A culture comprising bacteria that contain a 
previously described plasmid vector in an aqueous 

10 medium appropriate for the expression of the 517 

amino acid residue protein of M. tuberculosis is also 
contemplated. 

The present invention further contemplates a 
method for producing a 517 amino acid residue protein 

15 of M. tuberculos is . That method comprises the steps 
of culturing a replication/expression medium 
containing a plasmid vector for replioating and 
expressing foreign DNA sequence contained therein. 
That vector contains a foreign DNA sequence that 

20 corresponds substantially to "the previously mentioned 
DNA molecule that encodes the sequence of the 517 
M. tuberculosis protein. The vector also contains 
operatively linked nucleotide sequences regulating 
replication and expression of the foreign DNA 

25 sequence. The culturing is carried out under 

conditions suitable for expression of the protein 
that is encoded by the foreign DNA. The expressed 
protein encoded by that foreign DNA sequence is 
thereafter harvested. Culture is typically carried 

30 out using unicellular organisms as the 

replication/expression medium. Such unicellular 
organism are typically bacteria as described 
previously . 

A method for determining previous 

35 immunological exposure of a mammalian host to 
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Mycobacterium tuberculosis or Mycobacterium bovis is 
also contemplated. This method comprises the 
following steps. An inoculum that consists 
essentially of the purified 65KD (540) protein or an 

5 immunologically active portion thereof coded for by 
the DNA sequence of Figure 2 is administered 
intradermally to an assayed mammalian host. That 
protein is dissolved or dispersed in a 
physiologically tolerable diluent and is present in 

10 that diluent in an amount effective to induce 
erythema and induration in a mammalian host 
previously immunized with M. tuberculosis or 
M. bovis . The mammal is maintained for a time period 
of about 24 to about 72 hours, and thereafter is 

15 assayed for the presence of erythema and induration 
at the site of the intradermal administration at the 
end of that time period. In one aspect of this 
method the purified 65KD protein is obtained from a 
mycobacter ium such as M. tuberculosis . In another 

20 aspect of- this method, the'purified protein is a 
recombinant 65KD protein, or a recombinant fusion 
protein that contains a portion of a 
beta-galac tos idase molecule pept ide-bonded to the 
amino- terminus of the 65KD protein, or to the 

25 amino-terminus of an immunologically active portion 
thereof. This type of assay is usually referred to 
as a delayed cutaneous hypersensitivity (DCH) assay. 

Still another aspect of the invention 
contemplates an inoculum that consists essentially of 

30 the purified 65KD (540 amino acid residue) protein 

antigen or a fusion protein that is coded for by the 
sequence of Figure 2. That protein antigen is 
dissolved or dispersed in a physiologically tolerable 
diluent, and is present in the diluent in an amount 

35 that is effective to induce erythema and induration 
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in a mammalian host previously immunized with 
M. tuberculosis or M. bovis . The 65KD protein 
antigen of the inoculum can be one of the proteins 
useful in the method described immediately above. 

5 Still a further aspect of the invention is a 

peptide that consists essentially of a 5 to about 40 
amino acid residue sequence that corresponds 
substantially to a sequence of the 540 amino acid 
residue protein or the 517 amino acid residue protein 

10 coded for by the DNA protein sequence of Figure 2. 
More preferably, the peptide contains about 10 to 
about 20 amino acid residues. 

Preferred peptides include those having a ' 
sequence, written from left to right in the direction 

15 from amino-terminus to carboxy-terminus using single 
letter symbols, that corresponds to a formula 
selected from the group consisting of 
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wherein each first parenthesized number 
30 refers to the Peptide number of Tables 2 and 4, 

hereinafter, and the second hyphenated numbers refer 
to the position in the sequence of the 540 amino acid 
residue-containing protein whose complete amino acid 
residue sequence and genomic sequence are illustrated 
35 in Figures 2A and 2B. 
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Further contemplated is a method for 
ascertaining the presence of mycobacter ially-exposed 
or mycobacter ially- immune , i.e., previously 
immunologically exposed, mononuclear cells such as T 

5 cells in a body sample. Here, mononuclear cells from 
a mammalian host to be assayed are admixed and 
contacted in an aqueous cell culture medium with a 
stimulating amount of both antigen presenting cells 
and a preferred peptide antigen to form a stimulatory 

10 cell culture. That stimulatory cell culture is 

maintained for a time period sufficient for immune 
mononuclear cells present to be stimulated and to 
evidence their stimulation. The presence of 
mononuclear cell stimulation is thereafter 

15 determined. This assay can be carried out _in vivo as 
a DCH assay where the antigen presenting cells are 
endogenous cells such as macrophages and the aqueous 
medium is supplied by the blood and lymph* The assay 
can also be carried out in vitro . A polymer having 

20 an above peptide as repeating units can also be used 
as the antigen. 

An assay kit containing a preferred peptide 
in a container in an amount sufficient to carry out 
at least one assay as described immediately above is 

25 also contemplated. 

The invention still further contemplates a 

vaccine against mycobacteria such as M. 

n 

tuberculosis . The vaccine comprises a 
physiologically tolerable diluent containing as 

30 immunogen an immunizing effective amount of (i) a 
peptide antigen containing 5 to about 40 residues, 
and more preferably about 10 to about 20 residues, 
whose amino acid residue sequence corresponds 
substantially to a sequence of a mycobacterial 65KD 

35 protein and that is capable of stimulating 
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mycobacter ially- immune T cells having a phenotype 
selected from the group consisting of T4 + and T8 + 
or (ii) a polymer having said peptide antigen as 
repeating units. Preferably, the mycobacteria is M. 

5 tuberculosis . The mycobacteria to which the T cells 
are immune is the same mycobacterial species to which 
the vaccine is directed. 

Yet another aspect of the present invention 
is a polymer that comprises a plurality of 

10 pentapeptide repeating units. Each of those 

pentapeptide repeating units consists essentially of 
a sequence, written from left to right in the 
direction of amino-terminus to carboxy-terminus , 
represented by a formula 

15 

N N N I G; or 
X G N Z G, 

wherein X is an amino acid residue selected 

20 from the group consisting of F, S, T, L, D , and I; 
and Z is an amino acid residue selected from the 
group consisting of T, I, L, S and V. In a further 
aspect of this invention, the pentapeptide repeating 
units are bonded together by peptide bonds, whereas 

25 in yet another aspect, the pentapeptide repeating 
units are bonded together by oxidized cysteine 
residues at the terminii of those repeating units. 
Brief Description of the Drawings 

In the drawings forming a portion of this 

30 disclosure: 

Figure 1 is a schematic restriction map of 
recombinants expressing the M. tuberculosis 65KD 
antigen. The portion of the genome containing the 
65KD protein is shown as the heavy line at the top of 

35 the Figure along with the relative positions (short 
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perpendicular lines abutting the heavy line) of 
restriction endonuclease cleavage sites. The single 
letters adjacent those short lines are indicia of the 
endonclease that cleaves the genome at the indicated 
sites, and are: A = Sad , B = Bgl II, K = Kpnl , M = 
BamHI, P = PstI , R = EcoRI , S = Sal I , V = PvuII, and 
X = Xhol. 

Twenty of the recombinants discussed herein 
are enumerated along the right-hand margin of the 
Figure opposite the schematic line representations of 
the respective genomic portion contained by each 
recombinant. The lengths and positions of those 
genomic portions relative to the genome of the 55KD 
protein are shown by the relative lengths and 
positions of the lines. Dashes at the termini of the 
first six shorter lines indicate that those 
"recombinants contained additional base pairs, but the 
source and sequences of those additional base pairs 
is presently uncertain. 

DNA was isolated from phage stocks of the 
recombinants expressing the 65KD antigen as described 
by Helms et al. (1985) DNA 4^: 39-49, and a restriction 
enzyme cleavage site map was constructed. 

Figure 2 shows the nucleotide sequence of 
the region containing the M. tuberculosis 65KD 
antigen and 517 protein genes, and is provided as 
four sheets labeled 2A, 23, 2C and 2D. The deduced 
amino acid residue seqences of the two long open 
reading frames (ORFs) capable of coding for proteins 
containing 540 and 517 amino acid residues, 
respectively, are shown using the one letter code 
over (540) or under (517) the appropriate triplets. 
Asterisks above or below the respective sequences 
indicate the positions of stop codons (TGA, TAG or 
TAA) in the DNA sequences. Each sequence is shown as 
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beginning with the first methionine (M) residue in 
phase with the ORF and downstream of the nearest 
upstream stop codon. 

Figure 3 is a schematic representation of 
the open reading frames found in the portion of the 
mycobacterial DNA sequence that codes for the 65KD 
antigen. The heavy line near the top of the Figure 
represents a portion of the genome that includes the 
540 and 517 proteins. The shorter, arrow-tipped 
lines beneath the heavy line indicate DNA sequences 
that exceed 120 amino acid residues in length. 
Putative initiation triplets are identified on the 
shorter lines by the letter "M n (AUG) or the letter 
"V" (GUG) at the 5 ' -end of each open reading frame in 
the relatively shorter sequences illustrated beneath 
the heavy line. Arrows indicate the coding 
direction. 

Figure 4 is a photograph of a Western blot 
analysis of products of the 540 amino acid residue 
open reading frame, and contains two panels, A and 
B. Cells were grown and induced (except for lane 2, 
Panel A) and crude extracts were prepared as 
described in the Materials and Methods section, 
hereinafter. For each lane, except lane 5, 200 
micrograms (ug) of protein were electrophoresed on a 
10% Laemmli gel, and transferred to nitrocellulose. 
For lane 5, 500 ug of protein were loaded. The 
immobilized proteins were reacted with the IT-13 
antibodies and visualized, as discussed hereinafter. 

For Panel A, the proteins in the lanes 
were: lane 1, JM83; lane 2, JM83 (pT322) uninduced; 
and lane 3, JM83 (pTB22) induced with IPTG. For 
Panel B, the proteins in the lanes were: lane 1, 
JM83 (pTBl2) ; lane 2, Y1039 ( 7\ SK116) ; lane 3, Y1089 
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(*RY3146); lane 4, BNN97 [E. coli C600 containing 
^ gtlll ; and lane 5, JM83 (pTBl2) . 
DEFINITIONS -* ■ 

The following abbreviations and symbols are 
5 used herein. 



bp - base pair (s) 

kbp - 1000 bp 

10 KD - kilodalton (s) 

M - apparent relative .molecular mass 

DMA - deoxyribonucleic acid 

replicon - the unit that controls 

individual acts of replication; 
15 it has an origin at which 

replication is initiated and it 

can have a terminus at which 

replication stops . 

20 When used in a- context describing or 

depicting nucleotide sequences, the purine or 
pyrimidine bases forming the nucleotide sequence are 
depicted as follows: 

A - deoxyadenyl 
?5 G deoxyguanyl 

C - deoxycytosyl 
T - deoxythymidyl 

In describing a nucleotide sequence each three-letter 
30 triplet constituted by the bases identified above 

represents a trinucleotide of DNA (a codon) having a 
5 f -end on the left and a 3 f -end on the right of the 
upper sequence of Figure 2, and a S'-end on the right 
and a 3 '-end on the left of the lower, complementary 
35 sequence. 
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The word "antigen" is often used in the art 
for an entity that is bound by an antibody. The word 
"immunogen" is often used in the same context for the 
entity that induces the production of antibodies. 
Where the antigen and immunogen are the same entity, 
both are often referred to an antigen. 

The present invention deals with antigens 
and immunogens in the above context, which context 
typically relates to 3 cells and antibodies. 
Notwithstanding the 3 cell/antibody context, the 
present invention also contemplates T cells. 

A more general definition of immunogen and 
antigen apply in the context of T cells and T cell 
stimulation. In that more general definition, an 
"antigen" is an entity acted upon by a component of 
the immune system, and an "immunogen" is an entity 
that initiates an immune system response. Where 
antigen and immunogen are the same, both are referred 
to as an antigen. An "immunologically active" entity 
interacts with antibodies or T cells, or can initiate 
a cellular or humoral immune response. 
Detailed Description of the Invention 
I . OVERVIEW 

In studies discussed hereinafter, the 
isolation of the gene encoding the tuberculos i s 
55KD antigen and the determination of its nucleotide 
sequence are reported. The sequence contains an open 
reading frame encoding 540 amino acid residues or 
about 60,000 daltons, which corresponds to the 65KD 
antigen. A second long open reading frame capable of 
encoding a protein of 517 amino acids was also found 
on the mycobacterial DNA fragment containing the 65KD 
antigen gene, adjacent to that gene. Interestingly, 
the central region of the deduced amino acid residue 
sequence of the 517 amino acid protein contains 
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several tandemly arranged, perfect and imperfect 
repeats of a five amino acid residue sequence. This 
feature is reminiscient of the features of the 
sequence of the major T-cell antigen of the 

5 sporozoite stage of the human malarial parasite 
[Nussenzweig et al., (1985), Cell, 42: 401-403] . 
II. RESULTS 

A. Isolation and Analysis of Recombinants 
Expressing the 65KD Antigen 

10 To isolate the gene that encodes the 65KD 

antigen, monoclonal hybridoma antibodies directed 
against this antigen were used to screen a protein 
expression library constructed with mycobacterial 
DNA . An expression library was chosen since it was 

15 not known a priori if the tuberculosis genes would 

be expressed in coli . Such a recombinant DNA 

library has-been constructed by Young et al., (1985), 

Proc. Natl. Acad. Sci. USA , 82:2583-2587, and 

contains genomic DNA fragments of tuberculosis 

20 inserted into the expression site of the lambda-gtll 
( A gtll) vector. In this system, the inserted coding 
sequences can be expressed as a fusion protein with 

beta-galactosidase . The 65KD antigen-specific 

monoclonal hybridoma antibodies used in these studies 

25 were generated in the laboratories of Dr. T. M. 
Buchanon (Pacific Medical Center, University of 
Washinton, Seattle WA) and Dr. J. Ivanyi (MRC 
Tuberculosis Unit, Hammersmith Hospital, London) and 
were obtained from the Steering Committee on the 

30 Immunology of Tuberculosis of the World Health 
Organization . 

As the initial antibody probe, a pool 
containing three monoclonal antibodies directed 
against the 65KD antigen was used (IT-13, IT-31, and 

35 IT-33) . Thirty-eight positive signals were detected 
in a screen of about 8x105 recombinant phage. 
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The phage corresponding to the positive 
signals were twice plaque purified and then assayed 
for reactivity with the individual antibodies. The 
results of that purification and assay are shown in 
Table l f below. 



TABLE 1 

Patterns of Antibody Reactivities 



Number of Clones 
28 
3 
3 
2 
2 



Reactivity 
IT-13 

+ 

+ 



With Antibodies 
IT-31 IT-33 

+ + 

+ 

+ + 
+ 

+ 



Recombinant clones expressing antigens 
reactive with the 65KD antigen specific monoclonal 
antibodies IT-13, IT-31, and IT-33 were isolated as 
described in the text. For the initial screen, a 
pool of the three antibodies that contained a 1:1000 
dilution of each antibody was used to screen a total 
of about 3x10 recombinant phage from the lambda 
gtll-M. tuberculosis library. To determine which 
monoclonal antibody reacted with which of the 38 
plaque-purified recombinants, about 100 
plaque-forming units (pfu) of each recombinant phage 
were inoculated in small spots on a lawn of E. coli 
Y1090. The phage were allowed to grow, and were 
induced to synthesize the foreign proteins as 
described herein. The filters were then reacted with 
a 1:1000 dilution of one of the monoclonal hybridoma 
antibodies as described in Materials and Methods. 
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Twenty-eight of the recombinants produced 
antigens that reacted with all three antibodies, 
whereas ten recombinants produced antigens that 
reacted with one or two of the antibodies. Overall, 

5 the patterns of reactivity indicate that although the 
three antibodies react with the same mycobacterial 
antigen, each recognizes a different epitope on that 
antigen. Richard A. Young (Whitehead Institute, 
M.I.T.) has also screened this X gtll- M. tuberculosis 

10 library with one of these antibodies (IT-13) and 

detected 10 additional recombinants [Young et al., 
(1985), Proc. Natl. Acad, Sci. USA , 82: 2583-258] . 
These recombinants were not assayed for reactivity 
with the other antibodies. 

15 DNA was isolated from twenty of the 

recombinants expressing the 65KD antigen and a 
restriction enzyme cleavage site map was deduced for 
this region of the mycobacterial genome (Figure 1) . 
In most of the recombinants, the mycobacterial DNA 

20 insert was flanked by EcoRI sites as expected from 
the way in which the library was constructed. 

However, in 6 of the 20 recombinants 
studied, only one of the expected EcoRI sites was 
present. This observation raises the possibility 

25 that a significant fraction of the recombinant phage 
in this library might have arisen from the insertion 
of a fragment containing only one functional EcoRI 
site into the <A gtll EcoRI site or that some clones 
might have undergone some sort of recombination, 

30 rearrangement or deletion event during propagation 
that removed one of the EcoRI sites. 

The deduced restriction map is in good 
agreement with the published map of the gene for the 
M. bovis 65KD antigen [Thole et al., (1985), Infect . 

35 Immun . , _50: 800-306] except for the presence of two 
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additional Smal sites in the M. tuberculosis gene. 
The map does not match well with that of the M. 
leprae 65KD antigen gene [Young et al., (1985), 
Nature , 316 :450-452] . This is not unexpected given 
5 that based on DNA homology studies, M. tuberculosis 
is at least 90% homologous with bovis and only 
about 30% homologous with leprae , Athway et al., 

(1984) , Int. J . Syst. Bacterid. , 21 :371 ~ 375; l^aeda, 

(1985) Int. J, Syst. Bacterid ., 35 : 147-150 . 

10 To determine the nucleotide sequence of this 

region of the mycobacterial genome, several fragments 
from the Agtll recombinants were subcloned into the 
plasmid vector pUC19. The majority of the sequence 
of this region was determined from a subclone (pT37) 

15 of the 1.4 kilobase pair (kbp) EcoRI fragment of 
<VSK7 and a subclone (pTB9) of the 2.5 kbp EcoRI 
fragment of «^RY3143. The sequence across the EcoRI 
site at the junction of these two fragments was 
determined from a fragment isolated from a subclone 

20 (pTBll) of the 2.3 kbp Kpnl fragment of A.SK119. The 
sequence of the region 5' to the 2.5 kbp EcoRI 
fragment was determined from a subclone (pT3l2) of 
the 2.4 kbp Kpnl fragment of (\SK119. 

In all, the nucleotide sequence of 4380 base 

25 pairs of the mycobacterial DNA was determined by a 
combination of the Sanger dideoxy chain termination 
[Sanger et al., (1980), J . Mol. Biol. , 143:161-178] 
and Maxam-Gil?>er t chemical degradation [Maxam et al., 
(1976), Proc. Natl. Acad. Sci. USA , 74: 560-5641 

30 sequencing techniques. The sequence is shown in 
Figure 2. 

As expected for tuberculos is genomic DNA 
[Wayne et al., (1968), J. Bacterid. , 9 5:1916-19191 , 
the base composition of this fragment was about 66% 
35 G+C. The high G+C content increased the chances of 
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sequencing artifacts due to compressions, and made it 
imperative that the sequences were determined for 
both strands in all regions. 

B . Open Reading Frames 

5 The sequence contains five open reading 

frames (ORFs) that begin with an ATG triplet and 
contain greater than 120 amino acids. Two of these 
exceed 200 amino acids in length. One can encode 517 
amino acids and the other 540 amino acids. 

10 There are an additional three open reading 

frames of 140-190 amino acid residues in length that 
do not contain an initiation ATG triplet but do 
contain a GTG triplet. It is not known if a GTG 
triplet can function as a translation initiation 

15 triplet in mycobacteria. The locations of these 

eight open reading frames are shown schematically in 
Figure 3. No portions of the deduced amino acid 
sequences of any of these open reading frames 
displayed any significant homologies with sequences 

20 in the Protein Sequence Database of the Protein 
Identification Resource. 

It should be noted that although an open 
reading frame exceeding 100 amino acids would be 
considered to have a high probability of being. 

25 expressed into protein in most bacteria, this may not 
be true for the mycobacteria. That is, given that 
the G+C content of the insert is about 66%, a 
translation termination triplet (TAA, TAG or TGA) 
would be expected to occur on average about once 

30 every 41 amino acids as compared to about once every 
21 amino acids in a genome with a G+C content of 
50%. Perhaps then, an open reading frame of as many 
as 150-200 amino acids might be due to the random 
distribution of termination triplets rather than 

35 signifying possible biologic importance. As such, 
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only the two very long open reading frames that could 
encode proteins of 517 and 540 amino acid residues, 
respect i vely , are described herein. 

C. The 540 Amino Acid Residue ORF 

5 Corresponds to the 65KD Antigen 

One of the long open reading frames begins 
with an ATG triplet at positions 252-254 of the DNA 
sequence and extends to a TGA triplet at positions 
1872-1874. This ORF encodes 540 amino acids. To 

10 determine if this open reading frame corresponded to 
the gene for the S5KD antigen, the 1511 bp BamHI-Kpnl 
fragment from pT3l2 (residues 438-1948 of the 
sequence represented in Figure 2) , which contains the 
majority of this open reading frame, was inserted 

15 into BamHI-KpnI-cleaved pUC19. In this construct, 
denominated pTB22, the open reading frame is 
expressed using the lac Z transcription and 
translation initiation signals present in the pUC19 
vector, and results in the production of a fusion 

20 protein containing 15 amino acid residues at the 
amino-terminus encoded by the lac Z gene of pUC19 
followed by 478 amino acids of the mycobacterial open 
reading frame. 

Crude extracts were prepared from cells 

25 containing this plasmid, and were tested for 

reactivity with 65KD antigen-specific antibodies in 
Western blot analyses. The reactivity with 
monoclonal antibody IT-13 is shown in panel A of 
Figure 4. In all, five different monoclonal 

30 antibodies specific for the 65KD antigen reacted with 
a species in the crude extract that migrated with an 
apparent relative molecular mass (M ) of about 
55,000 daltons (lane 3). 

No reactivity was seen in extracts of 

35 E. col i lacking the plasmid (lane 1). Furthermore, 
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the expression of this fusion protein is inducible 
with" isopropyl-beta-D-thiogalactopyranoside (IPTG) 
(compare lanes 2 and 3). Therefore, it is concluded 
that this long open reading frame encompassing 
residues 252-1871 encodes the tuberculosis 65KD 
antigen. The phrases "540 amino acid residue 
protein", "540 protein", "65KD protein" and "65KD 
protein antigen" are used interchangeably herein for 
the 65KD protein of M. tuberculosis . 

In addition, the purified recombinant 65KD 
protein was used in Western blot analyses using serum 
from human patients known to be infected with 
M. tuberculosis . In preliminary studies, antisera 
from those patients immunoreac ted with the purified 
recombinant protein . 

Those studies illustrate the use of that 
natural or recombinant protein as an antigen in a 
diagnostic assay method for the presence of naturally 
occurring antibodies to the 65KD protein in the 
infected pat ients , "and thus for the detection of a 
Mycobacterium tuberculosis infection in those 
patients. Similar results are obtained in a more 
usual solid phase assay such as are carried out in a 
microtiter plate where the recombinant 65KD protein 
is affixed to a solid phase matrix to form a solid 
phase support and patient serum is the source of 
antibodies to be assayed. 

Solid phase assays whether carried out in a 
microtiter plate, a dipstick or as a Western blot all 
require the similar steps and constitute variants of 
each other. Each has a solid phase matrix (mirotiter 
plate well, stick surface or nitrocellulose) to which 
the purified natural or a recombinant 540 amino acid 
protein coded for by the genome of M. tuberculosi s as 
antigen is affixed, usually by adsorption, to form 



the solid phase support. The assayed sample such as 
patient serum or cerebrospinal fluid (where evidence 
of tubucular meningitis -is sought to be assayed) in 
liquid form is admixed with the solid phase support 
to form a solid-liquid phase admixture. That 
admixture is maintained under usual biological assay 
conditions (e.g, zero degrees C to about 40 degrees 
C) for a time period sufficient for any antibodies 
present in the assayed sample to immunoreact with and 
bind to the antigen of the solid phase support. The 
solid and liquid phases are separated as by rinsing. 
The presence of antibodies bound to the solid support 
is thereafter determined as with a labeled reagent 
that reacts with the bound human antibodies. 

A labeled reagent that reacts with bound 
human antibodies present is admixed with the solid 
phase to form a second solid-liquid phase admixture. 
That second solid-liquid phase admixture is 
maintained for a time period sufficient for the 
labeled reagent to react with the bound human 
antibodies. The second solid-liquid phase admixture 
is separated as by rinsing, and the amount of label 
present is determined. An amount of label present 
above a background, control value indicates the 
presence of anti-65KD protein antibodies and thus an 
infection by M. tuberculosis . 

The labeled reagent that reacts with the 
bound human antibodies is preferably a labeled 
preparation of xenogenic anti-human antibodies such 
as alkaline phosphatase-con jugated goat anti-human Ig 
antibodies that are available from Tago, 3urlingame, 
CA. The presence of the bound alkaline phosphatase 
is typically determined spec trophotometr ically by 
measurement of the enzymatic hydrolysis of a 
substrate molecule such as £-ni trophenyl phosphate to 



£-ni trophenol . Other enzymes such as horseradish 
peroxidase and other label types such as radioactive 
elements like iodine 125 are also useful. aureus 
protein A linked to a label such as I can also 
react with the bound human antibodies of the 
separated solid phases to detect their presence. 

The above diagnostic assay method is 
typically carried out in a clinical setting using a 
kit. The kit comprises at least one package that 
contains a solid phase support having a purified 540 
protein encoded by the M. tuberculosis genome that is 
from the mycobac ter ium or is a recombinant protein as 
discussed herein affixed as an antigen to a solid 
matrix such as a plastic microtiter plate or 
dipstick. One or more additional reagents such as 
the labeled reagent that reacts with solid 
phase-bound human antibodies, a substrate for the 
labeled reagent (where needed for the label) , buffer 
salts in solution or dry form, and the like can also 
be present in separate packages in the kit. 

D. The 65KD Antigen Gene is 
Expressed in S. coli 

Because previous studies had shown that most 
mycobacterial genes were not expressed in E. col i 
using the mycobacterial transcription and translation 
signal sequences [Clar k-Cur tis et al., (1985) , J . 
Bacteriol, , 161 :1093-1102; and Thole et al., (1985), 
Infect. Immun. , j>0: 800-806] a protein expression 
library was used in the cloning studies. In the 

gtll-M. tuberculosis library, the inserted 
mycobacterial coding sequences should be expressed as 
fusion proteins with beta-galactos idase [Young et 
al., (1983) Proc. Natl. Acad, Sci. USA , 
8^:2583-25871. It was somewhat surprising to find 
that the open reading frame encoding the 65KD antigen 
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did not extend to the 5' -end of the mycobacterial DNA 
insert in ^SK119. This suggested that the 65KD 
antigen was being expressed using the mycobacterial 
transcription and translation signal sequences. 

5 With respect to the previously described 

E . coli consensus signal sequences, the mycobacterial 
sequences 180-230 base pairs upstream of the presumed 
initiator ATG codon do display reasonable matches 
with the consensus sequences for the -35 (3/3 match 

10 with the highly conserved TTG) and -10 (4/6 match 

with TATAAT) regions of coli promoters [Rosenberg 
et al., (1979), Ann. Rev. Genet. , JL3: 319-353 1 . There 
is also a 5/5 match with the Shine-Dalgarno sequence 
[Shine et al., (1974), Proc. Natl. Acad. Sci. USA , 

15 _71: 1342-1346] for a prokaryotic ribosome binding site 
(GGAGG) 13 base pairs upstream of the presumed 
initiator triplet for the 65KD antigen open reading 
frame. Although the precise locations of the 
mycobacterial regulatory sequences have not been 

20 determined experimentally, the results of the two 
studies described below suggest that the 
mycobacterial sequences are indeed functional in 
S . coli . 

The size of the anti-65KD reactive material 
25 produced by the recombinants was determined in a 
Western blot assay. To do this, crude lysates of 
cells expressing recombinant plasmids or phage that 
had been shown to contain the entire 65KD antigen 
gene (?>SK116, pT312) as well as those that had been 
30 shown to contain a large portion of the 65KD antigen 
open reading frame fused to B-galac tos idase 
(<\RY3145? pT322 that contains the 540 protein DNA 
from position 438 through position 1948 of Figure 2) 
were prepared as described in the Materials and 
35 Methods section. 
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The lysates were electrophoreses! on 10% 
Laemmli SDS-polyacrylamide gels, and the separated 
proteins were electrophoret ically transferred to 
nitrocellulose. The SDS-denatured ; immobilized 
proteins were then reacted with monoclonal antibodies 
specific for the 55KD antigen. 

The results using antibody IT-13 are shown 
in Figure 4. In cells expressing recombinants 
carrying the fused open reading frame, the monoclonal 
antibodies detected a single strongly reactive 
species migrating with an of about 160,000 
daltons as well as occasionally detecting smaller 
species (Figure 4, Panel 3, lane 3). In another 
fused open reading frame recombinant, the monoclonal 
antibodies detected a single reactive species 
migrating with an M of about 55,000 daltons 
(Figure 4, Panel A, lane 31. In the extracts of the 
cells expressing recombinants that contained the 
entire 65KD gene, the monoclonal antibodies detected 
a single strongly reactive species that migrated with 
an M f of about 64,000 daltons (Figure 4, Panel 3, 
lanes 1 and 2) . ^ 

Smaller reacting species (about 
40,000-55,000 daltons) were observed when large 
amounts of the extracts were loaded (lane 5) or when 
the protease inhibitor was omitted from the lysis 
buffer. Occasionally, a minor reacting species was 
also observed migrating with an M r of about 67,000 
daltons , 

Given the sizes of the anti-65KD-react ive 
materials, these data indicate that the 65KD antigen 
can be expressed using the mycobacterial translation 
initiation signals present in the 65KD gene. Also, 
since the vector contribution to the recombinant 
plasmids does not contain any known sequences that 
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are properly located and oriented to promote the 
transcription of the inserted DNA , these data suggest 
that the mycobacterial transcription initiation 
signals function in coli to allow the expression 
5 of the 65KD antigen. 

In order to obtain an approximate measure of 
the efficiency of utilization of the mycobacterial 
transcription and translation initiation signals in 
E. coli , two plasmids were constructed that placed 
10 the expression of enzymat ically active 

beta-galactosidase under the control of either the 
mycobacterial signal sequences or the lac gene signal 
sequences present in the plasmid pUC19. 

First, the 3000 bp BamHI fragment from 
15 pMC1871 that contains the coding sequences for amino 
acid residues 8-1021 of beta-galactosidase [Shapira 
et al. r (1983), Gene, 2_5:71-82] was inserted into the 
BamHI site of pT3l2 (residues 437-442 of the sequence 
presented in Figure 2). The resulting 8,1 kbp 
20 plasmid. (pTB27) contains an open reading frame that 
encodes a fusion protein with 63 amino acid residues 
derived from the 65KD antigen gene followed by 1014 
amino acids of beta-galactosidase, and whose 
expression is under the control of the transcription 
25 and -translation signal sequences present in the 
mycobacterial DNA. As expected, this construct 
expresses a protein of about 120,000 daltons that 
reacted with anti-beta-galactos idase antibodies in a 
Western blot assay. 
30 Second, the 3000 bp BamHI fragment from 

pMC187l was inserted into the BamHI site in the 
polylinker of pT39 that contains a 2.4 kbp fragment 
of the 65KD antigen gene inserted in the EcoRI site 
of pUC19. The resulting 8.1 kbp plasmid (pTB28) 
35 contains an open reading frame that encodes a fusion 



^~WO 88/06591 



PCT/US88/00598 



protein with 15 amino acid residues derived from the 
pUC19 lac Z gene and polylinker sequences followed by 
the 1014 amino acid residues of beta-galactos idase 
and whose expression is under the control of the lac 

5 gene signal sequences present in.pUC19. 

Crude extracts of cells containing these 
plasmids were assayed for beta-galactosidase activity 
as previously described. In cells containing pTB27, 
beta-galactosidase activity [about 2800 

10 units/microgr am (ug) protein] was about one-fourth 

that (11,000 units/ug protein) found in IPTG-induced 
cells containing pTB28. Given the unknowns inherent 
in this study (e.g., the specific activities and 
relative stabilities of the two fusion proteins) , one 

15 cannot make a precise quantitative statement about 
the relative strengths of the mycobacterial signal 
sequences and the coli lac gene signal sequences 
based on the relative enzymatic activities found in 
the two cell extracts. However, the data do indicate 

20 that these mycobacterial transcription and 

translation signal sequences are efficiently 
recognized in coli . 

E . The 65KD Antigen Sequence 

Several interesting features of this long 

25 open reading frame have been revealed by a 

computer-aided analysis of the sequence. The overall 
base composition of this open reading frame is 65.5% 
G+C. However, the G+C content varies considerably 
within the codons such that the G+C content of the 

30 bases occupying the first two residues of the codons 
is 55% while it is 87% for the bases found in the 
third position of the codons; thereby producing a 
bias towards using codons that have a G or C in the 
third position. 



35 



\WO 88/06591 



PCT/US88/00598 



-28- 

For example, 50 of the 51 leucine codons 
(CTX) have a G or C in the third position. 
Interestingly, the essentially random occurence of 
any of the four bases in the first two positions of a 

5 codon plus the preference for G or C in the third 
position of a codon is one strategy that allows an 
organism to have a high G+C content without limiting 
access to the amino acids whose codons contain A or T 
residues in the first two positions. 

10 Although the deduced amino acid residue 

sequence of the 55KD antigen is particularly rich in 
alanine, glycine, leucine, and valine residues, the 
overall amino acid residue composition contains 52% 
hydrophobic and 48% hydrophilic residues. 

15 Computer-aided analysis of the alpha helical content 
Chou et al., (1978), Adv. Enzym, , £7:45-148 and 
hyrirophobicity [Hopp et al. , (1981), Proc. Natl. 
Acad. Sci. USA , 7^8 : 3824-3828 ] of the amino acid 
residue sequence revealed numerous regions that could 

20 participate in alpha helical structures and no 

extended regions of high hydrophobicity . These data 
suggest that the 65KD antigen is not an integral 
membrane protein but rather its sequence resembles 
that of a soluble protein. 

25 As discussed before, the 65KD antigen 

appears to be a major T cell immunogen and antigen in 
man. It has been suggested that immunodominant T 
cell epitopes are short stretches of amino acids that 
can form amphiphilic helices where one side of the 

30 helix is hydrophobic and the other side hydrophilic, 
Berzofsky, (1985), Science , 229 : 932-940 . Based on 
computer modeling, seven stretches of amino acids 
within the sequence of the 55KD antigen have been 
identified that could form such amphiphilic helices. 

35 A list of those peptides is shown in Table 2, below. 
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TABLE 2 



Residue 
Pos i t ions" 



Sequence ' 



15 



20 



25 



30 



35 



11-28 (58) 



66-79 (59) 



10 114-130 (60) 



154-172 (61) 



219-233 (23) 
394-408 (52) 
494-50S (63) 



ARRGLERGLNALADAVKV 

EKIGAELVKEVAKK 

GLKRGIEKAVEKVTETL 

QSIGDLIAEAMDKVGNEGV 

LLVSSKVSTVKDLL? 

IEDAVRNAKAAVEEG 

VKVTRSALQNAAS I A 



^■Residue positions are denominated using 
the one letter amino residue sequence of the 55KD 
protein shown in Figure 2 that depicts the methionine 
residue coded for by the triplet beginning at base 
pair position 252 as the first residue of the 
protein. Parenthesized numbers refer to peptide 
numbers that begin with petide number 1 shown in 
Table 4. 

2 These amino acid sequences are shown from 
left to right and in the direction from 
amino-terminus to carboxy- terminus , as is customary 
in the art. 

F. DCH Assay With A Recombinant 

65KD Protein 

Exemplary delayed cutaneous hypersensitivity 
(DCH) assays were carried out using illustrative 
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recombinant proteins described herein as test 
antigens after immunization with M. tuberculos is , M. 
bovis or saline. These assays were carried out 
f olXow i ncj the procedure described in Minden et al. 

5 (1986) Infec. Immun . 53:560-564. 

Briefly, the mammalian hosts were immunized 
with a sufficient amount of M. tuberculos i s or 
M. bovis to induce an immunological response, or with 
a control (saline) . After maintaining the animals 

10 for a time period sufficient for the initial 

immunological response to the immunogen to subside, 
the animals were challenged by intradermal injection 
with inocula containing the 55KD protein, a 
recombinant 65KD protein, or a recombinant fusion 

15 protein that contained the 65KD protein as the test 
antigen dissolved or dispersed in a physiologically 
tolerable diluent, or with a control. The test 
antigens were present in an amount sufficient to 
induce erythema and induration at the site of 

20 administration in a mammal- previously. . immunized with 
M. tuberculosis or M. bovis . 

The results of this study are shown in 
Table 3 , below. 



25 



Table 3 

DCH Assays With Recombinant Antigens 



30 



Challenge 
Antigen ^ 



No. Positive/No. Assayed 

Of Guinea Pigs Immunized With ' 



M. 

tuberculosis 



M. 

bovis 



Saline 



Saline (0) 
35 BNN97 3 (10) 



0/5 
0/5 



0/5 
0/5 



0/5 
0/5 
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/X 1089 4 (10) 5/5 5/5 0/5 

^10S9 4 (1) 5/5 5/5 0/5 

pT322 5 (10) 5/5 5/5 0/5 

pT322 5 (1) 5/5 5/5 0/5 

3CG-S 6 (1) 5/5 5/5 0/5 

PPd 7 (5 T.U.) 5/5 5/5 0/5 



"'"Challenge antigen compositions were 



injected intr adermally as discussed in Materials and 
Methods using amounts of 1 or 10 ug/100 ul per 
injection as indicated by the parenthesized numeral 
15 after each antigen, except for purified protein 
derivative (PPd) that was used in an amount of 5 
tuberculin units (T.U.). 

"The number of guinea pigs exhibiting 
20 positive DCH responses is in the numerator, whereas 
the number of guinea pigs assayed is in the 
denominator. The immunization protocol is described" 
in Materials and Methods. 

25 BNN97 was a crude lysate prepared from 

r, gtll-inf ected E. coli . The crude lysate was 
partially purified by ammonium sulfate precipitation 
as described in the Materials and Methods section. 

30 4 ^1089 was a crude lysate prepared from 

A SKll9-inf ected E. coli that expressed the 65KD 
antigen. The crude lysate was partially purified by 
ammonium sulfate precipitation as described in the 
Materials and Methods section. 

35 
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5 pTB22 was a crude lysate prepared from 
E . coli containing pT322 that expressed the 65KD 
antigen as a fusion protein that contained a portion 
of the beta-galactosidase molecule and about the 
5 carboxy-terminal 38 percent of the 55KD protein. The 
crude lysate was partially purified by ammonium 
sulfate precipitation as described in the Materials 
and Methods section. 

10 ^BCG-S was an extract of M. tuberculosis 

prepared as described in the Materials and Methods 
section . 

7 PPd was obtained from Connaught 
15 Laboratories, Ltd., Willowdale, Ontario, Canada. 

As can be seen from the above results, the 
65KD protein coded for by the DMA sequence of 
Figure 2 can be utilized in DCH as part of a method 

20 to determine whether a mammalian host such as guinea 
pig had previous immunolgical exposure to 
M. tuberculosis since the T leucocytes of the host 
animals produced erythema and induration at the sites 
of administration in the animals previously immunized 

25 with M. tuberculosis and M. bovis , and produced no 
reactions in the saline-immunized animals. Those 
results also show that recombinant 65KD protein 
molecules are similarly useful. Recombinant fusion 
proteins that contain a portion of the 

30 beta-galactosidase molecule pept ide-bonded to the 

amino-terminus of the 55KD protein are also useful, 
as are fusion proteins that contain a portion of the 
beta-galactosidase molecule and an immunologically 
active portion, about the carboxy-terminal 85% of the 

35 65KD protein, e.g., the protein expressed by pT322. 
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Fusion proteins that contain one or more peptide 
sequences as are described in Tables 2 and 4 
hereinafter are also useful. The phrase "previous 
immunological exposure" and its grammatical variants 

5 is used herein to mean that the mammalian host had 

been immunized or infected by one of the mycobacteria 
and the host mammal mounted an immune response 
(primary response) to the immunogens provided by the 
mycobacteria, and that that immune response had 

10 subsided. 

G . The 51~7 Amino Acid Protein 
1 . The Open Reading Frame 
A second long open reading frame begins with 
an A^G codon at positions 3948-3946 of Figure 2 and 
15 extends to a TAA triplet at positions 2397-2395 on 
the DNA strand complementary to the DNA strand 
encoding the 65KD antigen, thereby making those open 
reading frames adjacent in the genome. This open 
reading frame can encode a protein that contains a 
20 sequence of 517 amino acid residues, and that protein 
is referred to herein as the. "517 amino acid protein" 
or the "517 protein". The 517 protein coding region 
thus extends from position 3948 through position 2398 
of Figure 2. 

25 Given that the two long open reading frames 

are located adjacent and downstream from each other 
on the complementary strands, one might expect that 
the transcription of one gene might interfere with 
the transcription of the other unless there were 

30 transcription termination signals within the 

intergenic region. Indeed, there are several short 
sequences (e.g., 2134-2160) within the 520 base pair 
intergenic region that have features reminiscient of 
the transcription termination signals of 

35 gram-negative bacteria [Rosenberg et al., (1979), 
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Ann. Rev. Genet. , 13: 319-3531 . That is, regions 
containing short, G+C-rich, inverted repeats capable 
of forming stem and loop structures followed by a 
stretch of three or more T residues about 20 bases 

5 from the center of dyad symmetry. Perhaps these 
inverted repeats might function as transcription 
termination signals to allow the independent 
expression of each of these mycobacterial genes. 

To determine if the 517 amino acid open 

10 reading frame was expressed into protein in coli , 
extracts of cells containing a plasmid (pTBll) 
carrying the complete open reading frame were probed 
with a polyclonal rabbit antiserum elicited with a 
sonicated extract of tuberculosis bacteria in a 

15 Western blot assay. In these recombinants, the 

putative protein product of the 517 amino acid open 
reading frame would have to be expressed using the 
mycobacterial regulatory sequences. The polyclonal 
antiserum detected more than 100 species in an 

20 extract of tuberculosis cells as well as the 55KD 
antigen in extracts of coli cells carrying the 
appropriate plasmid (pTBl2) , but did not detect any 
novel proteins in extracts of coli cells 
containing plasmids carrying the 517 amino acid 

25 residue protein open reading frame. Hence, either 
this open reading frame is not expressed in coli 
using the mycobacterial regulatory sequences or the 
particular antiserum used in the immunoblots did not 
contain antibodies directed against this protein. 

30 it is not surprising that this open reading 

frame is not expressed in coli using the 
before-discussed recombinant since previous studies 
suggest that most mycobacterial genes are not 
expressed in coli [Clar k-Cur t iss et al., (1985^ , 

35 J . Bacterid. , 161: 1093-1102; and Thole et al . , 
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(1985), Infect, Intmun, , 50-800-8061 . Also, this open 
reading frame does not contain any impressive matches 
to the coli consensus promoter sequences within 
the 400 bases upstream of the ATG triplet although it 

5 does contain a 3/5 match with the Sh ine-Dalgarno 
consensus sequence for ribosome binding sites 12 
bases upstream of the initiator ATG triplet. 
Nonetheless, given the size of this open reading 
frame and its unique structural features (discussed 

10 below) , it most likely is expressed into protein in 

M, tuberculosis and can be expressed in E. coli using 
a recombinant vector designed for that expression, as 
is discussed hereinafter. 

2 . Structural Features of the 51*7 Protein 

15 The second long open reading frame could 

encode 517 amino acids or a protein of about 51,000 
daltons (calculated M. W. =50 , 551) . The deduced amino 
acid residue sequence is rich in alanine, asparagine, 
glycine, and serine and overall is composed of 54% 

20 hydrophobic^ residues and 46% hydrophilic residues. 
The amino' acid sequence of this protein does not 
display significant homologies with any of the 
protein sequences in the Protein Database. 

The most striking features of this sequence 

25 occur between amino acid residues 200 and 350, and 

more particularly at positions 217 through 328. This 
region contains many repeats of short stretches of 
amino acids. 

For example, the five amino acid sequence 

30 a spa r agine- asparagine- a spa r ag ine-i so leucine- 
glycine (N N N I G , using one letter code) is 
repeated three times consecutively at positons 227 
through 241. 

But perhaps the most interesting feature 

35 concerns a five amino residue sequence that displays 
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at least partial matches with several sequences in 
this region. These five residue sequence repeats 
begin at position 217 and continue through position 

*■> q *-» •£ T7» ■! r-» i •» r* o T»Ko ^rtneao<?nc c? p rrn a n r> p O^* f-Kie 

j j; O OX. -L" i y U I U • L lit- ^unOCiiu LiCZi ijCU uv. iiu? wj_ 

5 repeat appears to be X - glycine - asparagine - Z - 
glycine, or XGNZG, using one letter code. For the 
fifteen sequences that match this consensus sequence, 
X is most often phenylalanine , serine or threonine 
(12/15), although X can also be isoleucine, leucine 

10 and aspartic acid. Z is most often isoleucine or 
threonine (10/15), but is also sometimes serine, 
leucine or valine. Additional sequences between 
positions 200 and 350 display partial matches with 
the consensus sequence (i.e., match 2 of the 3 core 

15 residues) . 

The above five residue sequences are 
arranged, from the amino- terminus toward the 
carboxy-terminus , with two abutting (contiguous) 
XGNZG sequences that are contiguous with the three 

20 NNNIG sequences that are themselves contiguous to 
eight contiguous XGNZG sequences. A gap of about 
seventeen residues follows, that is itself followed 
by three contiguous XGNZG consensus sequences. 
Another gap of five residues ensues that abuts 

25 another two contiguous five residue XGNZG consensus 

sequences. Interestingly, both of those gaps contain 
sequences having two of the three core residues of 
the consensus sequence, as well as ^properly spaced X 
and Z residues. 

30 It is further noted that this region 

contains a direct repeat of a fourteen amino acid 
residue sequence with only one mismatch (residues 
295-308 and 315-328) . Those sequences are shown 
below using one letter code: 
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295-308 FNSGSGNIGFGNSG 
315-328 FNSGSGNIGIGNSG. 

As expected, since the amino acid residue 
repeats of the consensus sequences are not exact, the 
nucleotide sequences in this region are not exact 
repeats. This observation suggests that 
recombinat ional processes such as an unequal crossing 
over may not play a role in causing rapid 
evolutionary changes in this region as is often 
observed for highly repeated nucleotide sequences. 

The remainder of this protein sequence does 
not display any other particularly striking features. 

T'he highly repetitious nature of the 517 
residue protein is reminiscent of the repeated 
structures found in the major coat proteins of the 
sporozoite stage of the malaria parasite [Nussenzweig 
et al., (1985) Cell f 4j2 : 401-403] . These 
circumsporozoite or CS proteins are 40-60 KD proteins 
located on the membrane of the infectious sporozoite 
and contain a strongly immunodominant epitope that 
reacts with most of the anti-sporozoi te antibodies 
found in polyclonal antisera as well as all of the 
monoclonal antibodies raised against the sporozoite 
stage. The central region of these proteins contains 
20-40 tandemly arranged repeats of a 11-12 amino acid 
sequence . 

In Plasmodium falciparum , the immunodominant 
epitope is contained within three consecutive repeats 
of the sequence aspar ag ine-alanine-aspar ag ine- 
proline (NANP; which is repeated 37 times in one 
isolate) and antibodies directed against this 
12-residue repeat can provide immunologic protection 
against infection with the malaria parasite. The 
sequence of the repeat differs in the various species 
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of this parasite and the number of repeats can vary 
within different isolates of the same species. The 
similarity of the repeated nature of the CS protein 
and that of the 517 amino acid residue 
M. tuberculosis protein raises the interesting 
possibility that the repeated sequences in the 517 
residue protein might play some role in the immune 
response to mycobacteria. 

3 . Expression of the 517 Protein 
Although the 517 protein was not expressed 
using the before-described recombinant construct, 
that protein was expressed in E. col i using a 
recombinant expression vector designed specifically 
for its expression. That recombinant expression 
vector was constructed as follows, using the base 
pair numbering of Figure 2. It is to be understood 
that the DNA sequence of interest here is that shown 
in the lower of the two DNA sequences depicted, and 
that sequence, is read from right to left and in the 
direction from.5 ! -end to 3'-end, although the 
sequence position numbers are read from left to right 
and in the direction from 5 '-end to 3 1 -end for the 
upper sequence. 

The double stranded DNA sequence of Figure 2 
was cleaved with endonuclease PvuII to provide a 
fragment that extends from position 3511 to position 
4019 (509bp) . That fragment was ligated into the 
Smal site of the pUC19 vector to form intermediate 
I. Two orientations were possible for ligation of 
the PvuII fragment in the vector. Proper orientation 
was determined by usual methods such as isolation of 
several insert-containing clones and preparation of 
restriction maps of the DNA from those clones. For 
example, a Bgll fragment from a clone having the 
PvuII DNA fragment in the proper orientation contains 
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about 1500 bp, whereas a Bgll fragment from a clone 
having an improperly oriented PvuII fragment contains 
only about 1300 bp. Intermediate I was introduced 
into E. coli to propagate the vector DNA. 

5 The propagated DNA of intermediate I was 

thereafter cleaved with endonucleases NotI (position 
3603) and Sail (in the pUC19 polylinker site) . The 
resulting Notl-Sall fragment was discarded, whereas 
the remainder of the DNA of Intermediate I was 

10 retained. 

A further sample of the DNA sequence of 
Figure 2 was cleaved with endonucleases NotI 
(position 3603) and Sail (position 2202) to provide a 
Notl-Sall fragment that was ligated into the 
15 appropriate sites of the retained Intermediate I DNA 
to form a second pUC19-der i ved vector denominated 
Intermediate II. That vector contained the complete 
517 protein DNA sequence, and was propagated further 
in E . coli . 

20 -The propagated .DNA of Intermediate II was 

collected and cleaved with endonucleases EcoRI and 
Hindlll at their respective sites in the 517 protein 
gene and in the polylinker of pUC19. The resulting 
EcoRI-Hind III fragment that contained the 517 

25 protein DNA was thereafter collected and ligated into 
those respective sites in the polylinker of plasmid 
vector pKK223-3 to form Intermediate III that 
contained the carboxy-terminal portion of the gene. 
Intermediate III was cloned in coli JM105. 

30 (pKK223-3 and JM105 are available from Pharmacia Fine 
Chemicals, Piscataway, NJ . ) 

A further sample of the DNA of Intermediate 
II was cleaved with EcoRI alone to excise a portion 
of that DNA from a position in the polylinker to 

35 position 2969 in the 517 protein. The resulting 
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EcoRI fragment containing DNA that codes for the 
amino-terminal portion of the 517 protein was 
collected, and was thereafter ligated "into the single 
EcoRI site of Intermediate III to form the expression 

5 vector that contains the entire 517 protein gene. 

That vector was also cultured in E. coli JM105 as a 
replication/expression medium. 

It is noted that two orientations were 
possible for ligation of the EcoRI fragment in the 

10 expression vector. Proper orientation was determined 
by usual methods such as isolation of several 
insert-containing clones and preparation of 
restriction maps of the DNA from those clones. For 
example, a KpnI-Hindlll fragment from a clone having 

15 the EcoRI DNA fragment in the proper orientation 
contains about 2000 bp, whereas a KpnI-Hindlll 
fragment from a clone having an improperly oriented. 
EcoRI fragment contains only about 800 bp. 

Expression of a recombinant protein from 

20 vector pKK223-3 is inducible with IPTG , and the 

induced recombinant protein is expressed as the'* " 
protein itself, and not as a fusion product. The 
resulting E. coli cells were thus grown and then 
induced with IPTG, as discussed elsewhere herein. 

25 The expressed protein was produced in a 

relatively large amount and could be readily 
identified in an SDS-PAGE gel from a lysate of the 
E. coli cells. The 517 protein had an apparent M r 
of about 55,000 daltons in SDS-PAGE, as expected. 

30 The expressed 517 protein can also be 

collected and purified, as with an affinity column 
made from Sepharose 43 (Pharmacia) to which 
antibodies raised to one or more of the 517 
protein-related peptides are bound via the cyanogen 

35 bromide activation technique, or by ammonium sulfate 
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precipitation , followed by DEAE-cellulose 
chromatography . 

H . Recombinants and Vectors 

The present invention thus contemplates the 

5 purified recombinant 540 protein and 517 protein, as 
well as those recombinant fusion proteins that also 
include all or a portion of another molecule such as 
beta-galactosidase fused to the amino-terminus of 
those proteins. Each of those recombinant proteins 

10 is useful for inducing the production of antibodies 
that immunoreact with those respective molecules as 
obtained from M. tuberculosis itself or from cells 
infected with that mycobac ter ium. Methods of 
preparing such antibodies are well known in the art 

15 and are similar to the methods utilized for the 

peptides of this invention as described hereinafter. 

The purified recombinant 540 amino acid 
residue protein or its fusion proteins when present 
in an effective amount in an inoculum are also useful 

20 in a DCH assay, as described before. Those proteins 
are also useful in diagnostic methods and kits useful 
for assaying for the presence of infection by 
M . tuberculos is . 

Nucleotide sequences are also contemplated, 

25 as are non-chromosomal plasmid vectors useful for 
propagating those DNA sequences and expressing the 
^protein products coded for by those sequences. 

A nucleotide sequence of this invention 
consists essentially of one of the before-described 

30 sequences* Thus, a nucleotide sequence of the 

invention excludes additional nucleotides that affect 
the basic and novel characteristics of a nucleotide 
sequence that codes for the 540 protein or the 517 
protein . 

35 
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A nucleotide sequence of the invention can 
include one or more transcriptional promoter 
sequences operationally linked to the sequence 
adjacent to the 5 f -end thereof. Where translation of 
the DNA and protein expression are desired, the DNA 
also includes a translation initiating codon (ATG) 
and a translation terminating codon (TAA or TAG or 
TGA) , each operationally linked adjacent to the 
5 ! -end and 3'-end, respectively, of the sequence, 
with the translation initiating codon being located 
between the promoter sequence and the 5 '-end. 

A DNA sequence that codes for all or a 
portion of another molecule can also be included in 
the DNA molecule so that the translated (expressed) 
prote inaceous molecule is a fusion protein that 
includes an amino acid residue sequence of all or a 
portion of that other molecule fused (linked by a 
peptide bond) to the expressed 540 protein or 517 
protein. An exemplary fusion polypeptide is the fusion 
protein molecule discussed herein that" contains a 
portion of the be ta-glactos idase molecule fused to the 
amino-terminus of the 540 amino acid "residue protein. 

All of the nucleotide sequences shown in 
Figure 2 can be present so long as an enumerated DNA 
molecule remains replicable, where only replication is 
desired. Where replication and translation 
(proteinaceous molecule expression) are desired, those 
nucleotide sequences are present so long as the DNA 
molecule remains replicable and the proteinaceous 
molecule containing the amino acid residue sequence of 
540 protein or 517 protein expressed exhibits 
immunological cross-reactivity with the antibodies 
raised to an appropriate peptide described herein. In 
more preferred practice, only those base pairs needed 
for expression of a desired protein are utilized. 
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A non-chromosomal, plasmid vector for 
propagation and expression of a desired DNA nucleotide 
sequence as defined herein in a replication/expression 
medium, e.g., a unicellular organism or the like such 
5 as E. coli , j3. cerevisiae or mammalian cells such as 

COS cells, is also contemplated. That vector comprises 
a replicon that is compatible with the 

replication/expression medium and contains therein the 
foreign DNA molecule (e.g., all or a portion of the 
10 sequence shown in Figure 2) to be replicated in a 
manner such that the vector can propagate the DNA 
molecule . 

In addition, the non-chromosomal plasmid 
vector also includes those sequence components that are 

15 utilized for transcription and translation. To that 
end, a transcriptional promoter can be operationally 
linked to the DNA molecule present adjacent to the 
5'-end thereof, as already noted. ^he transcriptional 
promoter can be endogenous to the vector or exogenous 

20 to the vector. A transcriptional promoter endogenous 
to the vector such as the lac Z promoter-operator 
utilized in the vectors derived from pUCl9 or the 
trp-lac ( tac ) promoter of pKK223-3 is preferred. A 
translational terminator can also be operationally 

25 linked adjacent to the 3 f -end of the DNA molecule in 
some instances, although the nucleotide sequence 
represented by the formula of Figure 2 contains such 
terminator sequences . 

An initiation codon (ATG) adjacent to the 

30 5'-end of the sequence that begins translation 

in a replication/expression medium is also required 
to be present in a vector used for expression. Such 
a codon can be present in a defined DNA molecule in 
frame, as is the case with the sequences shown in 

35 Figure 2, or can be a portion of the precursor 
plasmid vector nucleotide sequence. 
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Th e before-discussed transcription promoter, 
translation initiating and translation terminating 
codons are frequently parts of the non-chromosomal 
pi asm id vector as compared to a DNA molecule of the 

5 invention. For use in expression of the 

proteinaceous molecule, the precursor plasmid 
frequently also includes a ribosome binding site 
(Shine-Delgardo sequence) adjacent to the 5'-end of 
the foreign DNA molecule and located upstream from 

10 the initiation codon, as is well known. The vector's 
promoter such as the lacZ and tac promoters utilized 
herein typically contain a ribosome binding site. 

Thus, the nucleotide sequence of the plasmid 
vector used for expression, aside from those 

15 nucleotides needed for the replication and general 

vector function include, in frame and from 5 ! -end to 
3'-end, a ribosome binding site operationally linked 
adjacent to the 5'-end of a transcription promoter; 
that promoter operationally linked to the 5'-end of 

20 the translation initiating codon; that codon 

operationally linked to the 5'-end of: (a) a sequence 
of a portion of another molecule that is expressed as 
a fusion protein with the desired protein, or (b) a 
foreign DNA molecule of this invention; where (b) is 

25 present, that sequence is operationally linked to the 
5-end of a DNA molecule of this invention. An 
expression vector containing the foreign DNA molecule 
of this invention, (however linked adjacent to its 
5 ! -end) also contains a translation terminating codon 

30 adjacent the 3 f -end of the foreign DNA. 

It is to be understood that all of the DNA 
sequences of the vector must be compatible with the 
replication/expression medium utilized for 
replicating the DNA, and more preferably for 

35 expressing a product coded for (encoded by) a DNA 
molecule of this invention. 
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It is also to be understood that the 
before-recited signal sequences of the useful vector 
can "be supplied to that vector by the foreign DNA or 
by a precursor to the final vector. For example , the 
5 translation initiation and termination codons in the 
expression vector for the 517 protein are provided by 
the foreign DNA, whereas the promoter and ribosomal 
binding site sequences are provided by the precursor 
plasmid . 

10 A vector of the invention is at least 

capable of replicating (propagating) a DNA molecule 
of the invention. More preferably, the vector is 
capable of not only replicating a DNA molecule, but 
is also capable of expressing or translating the 

15 genomic information of that DNA into a recombinant 
protein molecule that is immunologically similar to 
the 540 protein or the 517 protein; i.e., will induce 
cross- re active ant i bod ies . 

A non-chromosomal plasmid vector of this 

20 invention need not be limited to those vectors useful 
for replication and translation (expression) in 
E . coli as host replication/expression medium. 
Substantially any vector useful for replicating 
(propagating) and expressing a DNA sequence can be 

25 utilized for replicating the DNA, e.g. in mammalian 
or eukaryotic cells. 

A wide range of such vectors is commercially 
available as are appropriate host replication media. 
Exemplary vectors, both plasmids and bacteriophages 

30 and hosts are available from the American Type 

Culture Collection of Rockville, MD, and are listed 
in its CATALOGUE OF BACTERIA, PHAGES AND rDNA 
VECTORS, sixteenth ed . , 1985 . In addition, plasmids, 
cosmids and cloning vectors are listed as being 

35 available in catalogues from Boehringer Mannheim 

Biochemicals of Indianapolis, IN; Bethesda Research 
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Laboratories, Inc. of Gaethersberg , MD, and New 
England Biolabs, Inc. of Beverly, MA. 
I . Peptides 

Another aspect of the present invention 

5 relates to a peptide that consists essentially of an 
amino acid residue sequence that corresponds 
substantially to a portion of the 540 or the 517 
protein sequence. Such a peptide contains 5 to about 
40 amino acid residues, and more preferably about 10 

10 to about 20 amino acid residues that correspond 

substantially in sequence to a protein of either the 
540 amino acid residue protein or the 517 amino acid 
residue protein that are coded for by the DNA 
sequence shown in Figure 2. 

15 A useful peptide most preferably contains 

only those amino acid residues that are identical or 
homologous to (conservative substitutions for) 
residues present in a sequence of either of the two 
above proteins. Additional residues of substantially 

20 any length can also be present at either or both 
termini of the peptide. However, any additional 
residues must not interfere with the activity of the 
peptide, as discussed hereinafter, and therefore, a 
peptide of this invention is said to "consist 

25 essentially" of an enumerated sequence. For example, 
a peptide of the invention is free of 
immunosuppressing sequences. In addition, if 
additional residues are present, and together with an 
above peptide correspond substantially in sequence to 

30 further portions of the same protein to which the 
sequence of the peptide substantially corresponds, 
the resulting peptide is of a molecular weight less 
than that of the naturally occurring 540 or 517 
prote ins , respectively . 

35 A peptide of this invention is useful, inter 

alia , for inducing the production of antibodies in a 



^IVO 88/06591 



PCT/US88/00598 



-47- 

laboratory mammal such as a mouse or rabbit. Those 
induced antibodies immunoreact with the inducing 
peptide as well as with the protein to which the 
peptide sequence substantially corresponds when that 

5 protein is in an SDS-denatured form as in a Western 
blot analysis subsequent to SDS-PAGE analysis. 

Thus, the anti-peptide antibodies can be 
used in solid phase assays for the detection of the 
presence of an antigen that is the 540 protein or the 

10 517 protein of M. tuberculosis . In this instance, 
the assayed sample such as sputum provides the 
antigen that is affixed to the solid phase matrix to 
form the solid support. An aqueous composition 
containing the anti-peptide antibodies or their 

15 idiotypic portions (binding site-containing portions) 
is admixed, maintained and separated from the solid 
phase -as previously discussed for the presence of 
anti-65KD protein antibodies. The presence of bound 
anti-peptide antibodies is thereafter assayed to 

20 determine the presence of- the M. tuberculosis antigen 
in the sample, following the broad admixture, 
separation and analysis steps previously described. 
Whole antibodies and their idiotype-containing 
portions such as Fab and F(ab ! )^ portions are 

25 collectively referred to as paratoic molecules. 

In exemplary studies, antibodies (paratopic 
molecules) were raised in New Zealand white rabbits 
to both the .amino-terminal and carboxy-terminal 
polypeptide sequences (Peptides 1 and 54 of Table 4, 

30 hereinafter) of the 540 protein* Varying dilutions 
of pure M. tuberculosis cultures were bound to the 
walls of microtiter plates to form a solid support 
and one or the other of the two aqueous anti-peptide 
antibody preparations was admixed with the solid 

35 support to form a solid/liquid phase admixture. 

After maintaining the solid/liquid phase admixture 
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for a time period sufficient for the anti-peptide 
antibodies to bind to the mycobacterial antigens 
present, the phases were separated. The solid phase 
was rinsed to assure removal of unbound anti-peptide 

5 antibodies. The presence of anti-peptide antibodies 
bound to the solid support was thereafter determined 
by standard methods. 

As a result of those studies, it was 
determined that the presence of a mycobacterial 

10 antigen could be detected at a concentration of about 
10^ organisms per well. Sputum samples from 
persons with active infections of M. tuberculosis 
typically contain about 10^-10^ organisms in the 
volume of a sample utilized in the study. Thus, 

15 anti-peptide antibodies raised to a peptide of this 
invention such as those raised to Peptides 1 and 54 
can be utilized to detect mycobacterial antigens 
present at a level found in a clinical environment. 

Antibodies were similarly raised to the 

20 immunologically active recombinant, fusion, 540 

protein produced by pTB22, and a -similar antibody 
binding study was carried out. The results were 
generally similar to those discussed above, except 
that this assay was somewhat more sensitive, 

25 presumably as a result of the polyclonal character of 
the induced antibodies. 

In addition to the above assays for 
mycobacterial antigens, several additional 
immunoassays can be carried out using antibodies 

30 induced (a) by the previously-mentioned 540 protein, 
or more particularly (b) by an immunologically active 
portion thereof such as the fusion protein produced 
by pTB22, a fusion protein that contains a peptide 
sequence of Tables 2 or 4 fused to a portion or all 

35 of another molecule such as beta-galactos idase , or by 
a peptide of Tables 2 and 4. Such additional 



immunoassays are well known in the art and include, 
for example, double antibody, "sandwich", assays, and 
competition assays as where a peptide or other 
antigen described herein competes for the antibodies 
with a mycobacterial antigen in the assayed sample. 

In each of the immunoassays, a sample to be 
assayed for the presence of a mycobacterial 65KD cell 
wall protein-a antigen is admixed in an aqueous 
medium with paratopic molecules raised to the 540 
protein, or more particularly to an immunologically 
active portion thereof. The resulting admixture is 
maintained for a time period sufficient for the 
paratopic molecules to immunoreact with mycobacterial 
antigens present in the admixed sample to form an 
immunoreactant . ^he presence, and usually the 
amount, of immunoreactant formed is determined. 

The anti-peptide -paratopic molecules can 
themselves contain a label. Preferably, however, a 
second label-containing reagent is utilized that 
reacts with the bound paratopic molecules such as 
whole anti-peptide antibodies. The 
peroxidase-con jug a ted goat-ant i -mouse antibodies 
utilized herein are exemplary of such reagents. 

A solid phase assay kit that utilizes the 
anti-peptide antibodies or other paratopic molecules 
induced by an immunologically active portion of the 
540 protein is also contemplated herein for clinical 
use of the before-described method. Here, the kit 
contains at least a solid phase matrix to which the 
assayed-for antigen of the sample or antibodies can 
be affixed in one package and a preparation of 
anti-peptide or ant i-immunolog ically active 540 
protein portion paratopic molecules that immunoreact 
with the 540 (55KD) protein or the 517 protein in a 
second package. Additional packages of reagents 
similar in type and function to those previously 
mentioned can also be included. 
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For inducing paratopic molecules such as 
whole antibodies, a useful peptide is typically 
linked to an antigenic carrier molecule such as 
keyhole limpet hemocyanin (KLH) as a conjugate, the 
conjugate is thereafter dispersed in a 
physiologically tolerable diluent as an inoculum, and 
the inoculum is injected into the laboratory mammal 
using well known procedures. The inoculated animal 
is maintained and given booster injections as 
required, until a desired antibody titer to the 
inducing peptide is achieved. The mammal's 
antibody-containing serum is thereafter obtained, 
purified as desired, and utilized in a diagnostic 
assay such as an SDS-PAGE/Wes tern blot for the 
presence of a substantially corresponding protein. 

The word "inoculum" in its various 
grammatical forms is used herein to describe a. 
composition containing an amount of peptide 
conjugate, peptide polymer (as described 
hereinafter), 65KD protein' or ^recombinant protein 
sufficient for a described purpose that is dissolved 
or dispersed in an aqueous, physiologically tolerable 
diluent. Exemplary diluents are well known and 
include water, physiological saline, 
phosphate-buffered saline, Ringer's solution, 
incomplete Freund's adjuvant and the like. 

Inocula can contain varying amounts of a 
preferred peptide or polymer, depending upon its use. 

Where paratopic molecules are to be formed 
or an inoculum is otherwise to be used as a vaccine, 
about 100-500 micrograms of peptide or peptide 
polymer are used per injection into laboratory 
animals such as mice, rabbits or guinea pigs. Larger 
amounts are utilized for larger mammals, as is 
known. Similar amounts of peptide or polymer are 
utilized for _in vivo DCH assays. 
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Smaller amounts of antigenic peptide or 

antigenic peptide polymer are utilized for rn vitro 

stimulation assays. Using 200 microliters (ul) of 

4 

total volume and about 1-2x10 PBMC and about 

5 IxlO 5 antigen-presenting cells, concentrations of 
about 0.1 to about 50 micrograms of antigen per 
milliliter are useful. 

Exemplary procedures for the chemical 
synthesis of a useful peptide as well as preparation 

10 of a conjugate and use of the conjugate to raise 

antibodies can be found in U.S. Patent No. 4,636,453, 
No. 4,599,231, No, 4,599,230, No. 4,545,931, 
No. 4,544,500, all of whose disclosures are 
incorporated herein by reference. 

15 Another use for a preferred peptide of this 

invention is in an assay for the presence of 
mycobacter i ally-exposed (or immune); i.e., previously 
immunologically exposed, mononuclear cells such as T 
cells in a body sample containing such cells. 

20 Mycobacter i ally-exposed (or immune) mononuclear cells 
are cells that themselves have been immunologically 
exposed to a mycobacterial immunogen or whose 
progenitor cells had been so exposed to such an 
immunogen. Thus, a preferred peptide can be used to 

25 determine whether a mammal has been immunized against 
a mycobacter i um or has or has had a mycobacterial 
i nf ection . 

In such an assay, peripheral blood 
mononuclear cells, and particularly T cells, from the 

30 mammal are provided. Those cells are admixed and 
contacted in an aqueous cell culture medium with a 
stimulating amount of both antigen presenting cells 
and a preferred peptide of the invention to form a 
stimulatory cell culture. The stimulatory cell 

35 culture is maintained for a period of time sufficient 
for immune mononuclear cells present to be stimulated 
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and to evidence that stimulation, usually about 13-96 
hours and most usually 24-43 hours, under usual cell 
culture conditions. The presence of mononuclear cell 
stimulation is thereafter determined. 

Where mononuclear cell stimulation is found, 
it indicates the presence in the assayed mononuclear 
cell population of cells which themselves were 
immunologically exposed to a mycobacter ium or whose 
parental line was immunologically exposed to a 
mycobacter ium . 

As is illustrated by the results shown in 
Table 3, hereinbefore, the recombinant 540 protein 
and recombinant fusion protein containing a portion 
of the bet a-galactos idase molecule and an 
immunologically active portion of the 540 protein 
were useful in stimulating mycobac ter i ologi cally- 
immune mononuclear cells i_n vi vo in a DCH assay. 
Such molecules can be utilized in the above-described 
assay, and in the stimulatory assays described 
hereinafter in the same manner as can the peptides, 
and in the place of a peptide. 

Mononuclear cell stimulation can be 
determined in a number of manners that are well known 
in the art, some of which are described specifically 
hereinafter. The cells of the mononuclear cell 
population that most usually are stimulated are T 
cells, and for that reason, the mononuclear cells 
will be usually referred to hereinafter as T cells. 
More particularly, T cells that exhibit the CD4 or T4 
(CD4 + or T4 + antigen and those that exhibit the 
CD8 or T3 (CD8 + or T8 + ) antigen are the cells 
that are typically stimulated. Those T cells are 
often more generally referred to as helper and killer 
or cytotoxic T cells, respectively. 

Exemplary manners in which T cell 
stimulation can be determined include (a) 
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prolif er ation as assayed by the uptake of a 
radiolabeled nucleoside such as [ ] -deoxy thymidine 
also referred to as [^H]-Tdr, [ 3 H] -thymidine 
([ 3 H]-T), (b) secretion of inter f eron-gamma , (c) 

5 secretion of i nterleuki n- 2 (IL-2), (d) secretion of 
granulocyte macrophage-colony stimulating factor 
(GM-CSF) , (e) cytotoxicity, a phenomenon that can 
occur with T cells such as T4 + T cells as well as 
with T8 + cells, (f) the ability to provide an ^in 

10 vitro B cell helper function, (g) the ability of 

immune T cell clones to provide a delayed cutaneous 
hypersensitivity (DCH) response ir\ vivo as described 
herein and in U.S. Patent No. 4,639,397 whose 
disclosures are incorporated by reference, and 

15 (h) the ability of immune T cell clones to provide 
protective immunity irx vi vo . 

A kit is also contemplated for use with the 
immediately preceding assay. That kit can include a 
number of containers, at least one which contains a 

20 preferred peptide antigen of this invention or a 
polymer of such a peptide antigen whose repeating 
units are comprised of a "di-Cys-terminated" peptide 
as is described hereinafter. A mixture of two or 
more such preferred peptides or their polymers can 

25 also be present. A sufficient amount of a preferred 
peptide or peptide polymer is contained in the 
container to perform at least one assay using that 
method . 

The assay kit can further include a 
30 premeasured amount of buffer or other salt for the 

preparation of an inoculum of the peptide or polymer 
upon the addition of water or other suitable aqueous 
medium. The inoculum can also be provided in 
premixed aqueous form either at the concentration for 
35 use or as an aqueous concentrate to be diluted. 



Of course, the particular constituents and 
concentrations of those constituents can differ 
between rn vitro and in vivo assays as- well as 
between different mammals whose cells are to be 
assayed. Such constituents and concentrations can be 
readily determined by skilled workers. It is to be 
further understood that a previously described fusion 
protein that includes an immunologically active 
portion of a mycobacterial 65KD cell wall protein-a 
antigen can also be used to the exclusion of a 
peptide or polymer thereof as the antigen. Thus, the 
antigen of the kit can more broadly be described as a 
mycobacter ial antigen. 

A useful peptide corresponds substantially 
in sequence to a sequence of either the 540 or the 
517 proteins discussed previously. Substantial 
correspondence of peptide sequences can be determined 
in a number of ways. 

Of course, two peptides having identical 
sequences correspond substantially, as do to peptides 
that share identical sequences but also contain one 
or more further sequences. Similarly, two sequences 
that differ by conservative substitutions such as 
isoleucine for leucine or valine, asparatic acid for 
glutamic acid, asparagine for glutamine, arginine for 
lysine, serine for threonine, phenylalanine for 
tryptophan and tyrosine for phenylalanine, also 
correspond substantially. 

Two sequences can also correspond 
substantially when antibodies raised to one 
immunoreact with another. For example, the 
particular peptides disclosed hereinafter can be used 
to raise antibodies that immunoreact with the 65KD 
(540) protein, and consequently, those peptides 
correspond substantially in sequence to the sequence 
of the 65KD protein. 
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Biochemical evidence from immunoassay and 
from analogy with conserved protein-protein 
interaction in solved X-ray crystallogr aphic 
structures with differing sequences such as in the 
dimer contacts of oligomeric enzymes indicates that 
the conservation of protein-protein recognition does 
not require a strict conservation of sequence, for 
relatedness. Whereas single amino acid residue 
changes may affect such recognition to a wide degree 
depending upon the nature of the subs ti tution , in 
general terms the relatedness and thus substantial 
correspondence of two differing amino acid sequences 
with respect to protein-protein (and antigenic and/or 
immunogenic) recognition can be expressed in terms of 
seven basic amino acid residue parameters: 

(1 ) hydr ophobici ty ; 

(2) evolutionary occurrence of changes in 
known sequences; 

(3) size of side chain; 

(4) charge and polarity; 

(5) preference for turned secondary 
structure ; 

(6) preference for beta strand secondary 
structure; and 

(7) preference for helical secondary 
structure • 

To define the degree of sequence identity 
relevant to antigenic and/or immunogenic recognition, 
and thus substantial correspondence of peptide 
variants, a consensus matrix based upon the above 
seven criteria can also be used to assign numerical 
values for each amino acid residue pair in the 
sequences being considered for substantial 
correspondence. For the purposes of the present 
invention, a consensus matrix developed by Dr. 
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Elizabeth Getzoff and Dr. John Tainer of the Scripps 
Clinic and Research Foundation of La Jolla, CA can be 
used* That consensus matrix is as follows, wherein 
the individual amino acid residues are designated by 
5 a one-letter code in the interests of conciseness: 




30 



Sequence comparison using the foregoing 
consensus matrix involves the determination of all 
possible alignments and the subsequent scoring of 
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For example, to ascertain the substantial 
correspondence of the amino acid residue sequences 
-Lys-Tr p-Phe-Cys-Gly- 

an A 
* * w 

-Arg-Ile-Phe-Cys-Gly- 

the consensus matrix yields the following values 



Value 



-Lys- 


& 


-Ar g- 


or 


K 


& 


R 


5 


-Trp- 


& 


-Ile- 


or 


W 


Sc 


I 


0 


-Phe- 


& 


-Phe- 


or 


F 


Sc 


F 


7 


-Cys- 


& 


-Cys- 


or 


C 


Sc 


C 


7 


-Gly- 


& 


-Gly- 


or 


G 


Sc 


G 


3 



Total 27 

For substantial correspondence at the 99.7% 
confidence level, the consensus matrix score must 
exceed the number of amino acid residue pairs under 
consideration times 3; i.e., 5x3 or 15. Inasmuch as 
27 is greater than 15, substantial correspondence is 
indeed present for the above two peptide sequences. 

For the purposes of the present invention, 
substantial correspondence among peptides within the 
scope of the invention preferably is present at least 
to about 95% confidence level, and more preferably to 
at least about 99% confidence level. 

A DNA sequence can correspond substantially 
to another DNA sequence if both sequences contain 
sequences of fifteen bases that are in phase and 
identical, or bases that are not identical but code 
for an identical sequence of amino acid residues, or 
code for amino acid residue sequences that correspond 
substantially. Thus, amino acid residue sequences 
that correspond substantially are encoded by DNA 
sequences that correspond substantially. 
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It is to be noted that two or more peptide 
sequences can substantially correspond as determined 
by one or both of the latter two definitions and 
still exhibit different immunor eacti vi ti es with 
antibodies raised to the intact protein as is found 
in nature or with T cells stimulated by such natural 
proteins. An example of this phenomenon is discussed 
her einaf ter . 

In addition to the specific peptides 
disclosed in Table 2, hereinbefore, further peptides 
that correspond in sequence to a portion of the 540 
protein sequence are also useful herein. A list of 
those peptides is provided in Table 4, below. 



15 



Table 4 



Peptides 



20 Peptide 



Number Residues" 



Sequence ' 



1-15 



MAKTIAYDEEARRGL 



25 



30 



5 
6 



35 



11-25 
21-35 
31-45 
41-55 
51-65 
61-75 



ARRGLERGLNALADA 



ALADAVKVTLGPKGR 



GPKGRNVVLEKKWGA 



KKWGAPTITNDGVSI 



DGVS IAKEIELEDPY 



LEDPYEKIGAELVKE 
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8 71-85 ELVKEVAKKTDDVAG 

9 81-95 DDVAGDGTTTATVLA 
5 10 91-105 ATVLAQALVREGLRN 

H 101-115 EGLRNVAAGANPLGL 

12 111-125 NPLGLKRGIEKAVEK 

10 

13 121-135 KAVEKVTETLLKGAK 

14 131-145 LKGAKEVETKEQIAA 
15 15 141-155 EQIAATAAISAGDQS 

16 151-165 AGDQSIGDL I A E A M D 

17 161-175 AEAMDKVGNEGVI TV 

20 

18 171-185 GVITVEESNTF. GLQL 

19 131-195 FGLQLELTEGMRFDK 
25 20 191-205 MRFDKGYISGYFVTD 

21 201-215 YFVTDPERQEAVLED 

22 211-225 AVLEDPYILLVSSKV 

30 

23 219-233 LLVSSKVSTVKDLLP 

24 231-245 LLPLLEKVIGAGKPL 
35 25 241-255 AGKPLLI IAEDVEGE 



^»WO 88/06591 



PCT/US88/00598 



10 



26 
27 
28 
29 
30 
31 
32 

15 33 
34 
35 

20 

36 
37 

25 38 
39 



30 



40 
41 
42 

35 43 



251-265 

261-275 

271-285 

281-295 

291-305 

301-315 

311-325 

321-335 

331-345 

341-355 

351-365 

361-375 

371-385 

381-395 

391-405 

401-415 

411-425 

421-435 
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DVEGEALSTLVVNKI 

VVNKI RGTFKSVAVK 

SVAVKAPGFGDRRKA 

DRRKAMLQDMAILTG 

AILTGGQVISEEVGL 

EEVGLTLENADLSLL 

DLSLLGKARKVVVTK 

VVVTKDETTIVEGAG 

VEGAGDTDAIAGRVA 

AGRVAQI RQEIENSD 

IENSDSDYDREKLQE 

EKLQERLAKLAGGVA 

AGGVAVIKAGAATEV 

AATEVELKERKHRIE 

KHRIEDAVRNAKAAV 

AKAAVEEG I VAGGGV 

AGGGVTLLQAAPTLD 

APTLDELKLEGDEAT 
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45 



431-445 



441-455 
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GDEATGANIVKVALE 
KVALEAPLKQIAFNS 



46 



451-465 



IAFNSGLEPGVVAEK 



10 



47 



48 



49 



461-475 
471-485 
481-495 



VVAEKVRNLPAGHGL 



AGHGLNAQTGVYEDL 



VYEDLLAAGVADPVK 



50 



491-505 



ADPVKVTRSALQNAA 



15 51 



501-515 



LQNAAS IAGLFLTTE 



52 



511-525 



FLTTEAVVADKPEKE 



20 



53 
54 



521-535 
526-540 



1,2 



KPEKEKASVPGGGDM 



KASVPGGGDMGGMDF 



See Notes 1 and 2 of Table 2. 



25 Preferred peptides for use in the previously 

described assay for the presence of raycobacteri ally- 
immune mononuclear cells are those that are numbered 
as follows, wherein the numbers are those shown in 
one or both of Tables 2 and 4, and whose sequence 

30 positions in the 540 protein are given in 

parentheses: Peptide 22 (211-225); Peptide 23 
(219-233); Peptide 24 (231-245); Peptide 30 
(291-305); Peptide 46 (451-465); Peptide 58 (11-28); 
Peptide 59 (66-79); Peptide 60 (114-130); and Peptide 

35 62 (394-408) . 
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Several proliferative assays were conducted 
using the peptides of the invention. Results of 
those studied are shown and discussed below. 

One study was carried out using pooled 
peripheral blood mononuclear cells (PBMC) from 
M. bovi s BCG - vaccina ted humans. The details^of this 
study are described in the Materials and Methods 
Section. Briefly, PBMC were isolated and seeded into 
culture plate wells. Such PBMC populations contain 
their own endogenous antigen-presenting or feeder 
cells. A peptide of the invention was added as 
antigen at either 0.1 microgram per milliliter 
(ug/ml), 1 ug/ml or 10 ug/ml of culture. The 
antigen/cell culture mixture was maintained for a 
time period of six days, at which time, radiolabeled 
thymidine was admixed. The cultures were harvested 
eighteen hours later and the thymidine t incorporation 
was measured in a liquid scintillation counter. The 
results of this study are shown in Table 5, below. 

Table 5 

Protein 540 
Peptide- Induced PBMC Prol if er at ion 1 



Peptide Residue Proliferation 

Number ^ Posi tions ^ Response ^ 

10 91-105 

21 201-215 

22 211-225 ++ 

24 231-245 ++ 

25 241-255 

30 291-305 ++ 

35 341-355 
43 421-435 
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46 451-465 +++ 

47 461-475 

48 471-485 

49 481-495 
5 53 521-535 

54 526-540 

58* 11- 28 ++ 

59* 66- 79 ++ 

60* 114-130 +++ 

10 61* 154-172 

23* 219-233 +++ 

62* 394-408 +++ 

63* 494-508 



15 Proliferation as measured by 



incorporation of [ H] - thymidine in counts per 
minute (cpm) . 

Peptide number as shown in Tables 2 and 4. 
^ Peptide residue sequence positions as shown 
20 in Tables 2 and 4 and in Figure 2. 

4 Proliferative response reported at the 
optimal peptide concentration is represented as follows: 
"+++" = 10,000-40,000 cpm; "++" 2000-10,000 cpm; or — = 
300-700 cpm. Proliferation in the absence of peptide 
25 antigen was 421+37 cpm, and was 82,857 +2,913 cpm in the 
presence of an extract of M. tuberculosis . Standard 
deviations did not exceed 15 percent in any of the 
triplicate measurements. 

Peptides predicted to form amphiphilic 

30 helices . 

The above results indicate that nine of the 
twenty petides assayed elicited a strong 
proliferative response. Thus, nine regions of the 
35 540 protein were identified as human T cell antigens. 
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Seven regions of. the 540 protein were 
predicted by computer-assisted analysis to form 
amphiphilic helices. Regions that can form 
amphophilic helices appear to have a higher 

5 probability of being recognized by T cells. 

Berzofsky, (1985) Science , 229:932-940. However, 
only five of those seven peptides provided a 
proliferative response. This indicates that 
amphiphilici ty is neither sufficient nor necessary 

10 for a peptide to interact with T cells. 

In further studies with PBMC from individual 
BCG-immuni zed humans, an influence of HLA type was 
noted on reactivity. Thus, lymphocytes from two 
persons with the HLA-DR4 allele reacted with Peptide 

15 62 (positions 364-408) but not with peptide 30 

(positions 291-305) , whereas cells from three persons 
with the HLA-DRl allele reacted with Peptide 30 but 
not with Peptide 62. 

The above results indicate a genetic, HLA 

20 restriction on the prolieration response. Thus, a 
mixture of preferred peptides or their polymers is 
preferred when assaying an out bred population such 
as humans so that false negative responses can be 
minimi zed . 

25 Another proliferation study was carried out 

with sixteen of the above peptides. The 
proliferating cells here were either one of two T 
cell clones or a polyclonal T cell line. One T cell 
clone came from a tuberculosis patient (AH) and is 

30 designated K8AH. The second T cell clone was 
obtained from a person (JM) vaccinated with 
heat-killed M. leprae and is referred to as A7JM . 
The polyclonal T cell line- (JM) was also obtained 
from the cells of JM that were initially stimulated 

35 with M. bovis BCG , stored frozen and thereafter 
stimulated with a recombinant 65KD protein from 
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M. tuberculosis (Oftung et al . , (1987) J . Immunol . , 
128:927-931] . 

Proliferation was again assayed by the 
[ ^H]-thymidine incorporation method. Here, 

5 autologous PBMC irradiated to inhibit proliferation 
but sufficiently viable to act as antigen-presenting 
cells were added to the cultures of both isolated T 
cell clones and- the cell line along with a 
mycobacterial antigen* After three days of 

10 maintenance, the cultures were pulsed with the 
radiolabel for four hours, harvested and then 
counted . 

The details of this study are provided in 
the Materials and Methods Section. The results are 
15 shown in Table 6, below. 



Table 6 



Protein 540 

2 0 Peptide- Induced Proliferation '*' 

Peptide Prol if er at i ve Response 



Number K8AH A7JM JM 



25 


59* 


0.3 


+ 


0.1 


0.2 


+ 


0.1 


0.3 


+ 


0.2 




10 


0.4 


+ 


0.2 


0.1 


+ 


0 


0.2 


+ 


0 




21 


0.3 


+ 


0 


0.1 


+ 


0 


0.2 


+ 


1 




22 


0.5 


+ 


0 


14.0 


+ 


2.0 


88.2 


+ 


15.8 




23* 


0.2 


+ 


0.1 


0.1 


+ 


0.1 


9.0 


+ 


2. 8 


30 


24 


7.4 


+ 


0.3 


0.1 


+ 


0 


4.1 


+ 


0.1 




25 


0.9 


+ 


0.2 


0.1 


+ 


0 


0.3 


+ 


0.2 




30 


0.4 


+ 


0.1 


0.1 


+ 


0 


0.7 


+ 


0.2 




35 


0.4 


+ 


0.1 


0.4 


+ 


0 


0.3 


+ 


0.1 




52* 


0 . 4 


+ 


0.3 


0 . 2 


+ 


0 


0.3 


+ 


0.1 


35 


43 


0.3 


+ 


0.1 


0.1 


+ 


0.1 


0.2 


+ 


0.1 




46 


0.5 


+ 


0.1 


0.5 


+ 


0 


11 . 4 


+ 


0 . 2 
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49 
53 
54 
63 



0.5 + 0.1 

0.1 + 0 

0.4+ 0.2 

0.4 + 0 



0 +0 
0.2 + 0.1 
0.2 + 0.1 
0.2 - 0.1 



0.4 + 0.3 

0.2+ 0.1 

0.9 + 0.1 

0.4 + 0.1 



-Ag 



0.5 + 0.2 



0.3 + 0.2 



0.4 + 0.2 



30.1 + 2.6 

118.7 + 5.4 

183 . 4 + 19.0 

119 .7 + 15.5 



' See notes of Table 5. 
15 3 Proliferative responses, in cpm x 10~ 3 + 

standard derviation for two or three replicate studies 
using 10 ug/ral of each peptide. Positive values are 
underlined . 

4 Response in the absence of antigen. 
20 ^ Affinity-purified recombinant M. tuberculosi s 

65 KD protein at 50 ug/ml and whole mycobacteria at 20 
ug/ml. 

The results shown in Table 6 illustrate the 
25 clonal specificity for antigens of the screened 

peptides. Thus, T cell clone K8AH, specific to the 
M. tuberculosis complex [Oftung et al . , (1987) J. 
Immunol . , 138 ; 927-931 ] exhibited a proliferative 
response upon stimulation with an inoculum containing 
30 Peptide 24 (231-245) . The T cell clone A7JM, which 
shows cross-reactivity to M. tuberculosis and M. 
leprae , responded to stimulation by admixture and 
contact with an inoculum containing a different 
segment of the 65KD (540 protein) represented by 
35 Peptide 22 (211-225) , but not to inocula containing 
the flanking and overlapping Peptides 21 (201-215) 
and 23 (219-233) . 



M. tuberculosis 







rec. 65KD 5 


4. 


3 


+ 


0 


. 5 


12. 


3 


+ 


1 


.9 


10 


M. 


tuber culosi s^ 


8. 


5 


+ 


1 


. 6 


11. 


5 


+ 


1 


.5 




M. 


bovis BCG 5 


9 . 


4 


+ 


0 


. 7 


21 . 


6 


+ 


2 


. 3 




M. 


leprae^ 


0. 


5 


+ 


0 


.1 


24. 


2 


+ 


2 


. 0 
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Th e JM polyclonal T cell line also 
proliferated in response to contacting with inocula 
containing Peptides 24 and 22. That cell line also 
showed a significant proliferation in response to 

5 admixture and contact with an inoculum containing 
Peptide 23, whose sequence overlaps both of those 
sequences of Peptides 24 and 22, and that was 
predicted to form an amphiphilic helix* Contacting 
the polyclonal T cell line with an inoculum 

10 containing Peptide 46 (positions 451-465) also 
provided significant stimulation. 

The genomic sequence of the 65KD protein of 
M. leprae and the putative translation product of 
that gene have been published. [Mehra et al., (1986) 

15 Proc. Natl. Acad. Sci. USA , 83: 7013-7017 . ] A 

comparison of the 55KD protein amino acid residue 
sequences from M. leprae and M. tuberculos is shows 
the two sequences to be very similar, with only a 
relatively few different residues between them. 

20 T cell clone A7JM had previously been shown 

to proliferate in response to stimulation by both 
whole M. leprae and whole M. tuberculosis . [Mustafa 
et al., (1986) Lepr . Rev. Suppl . , 2 :123 " 130 -1 
Consistent with those findings, clone A7JM also 

25 proliferated when admixed and contacted with an 

inoculum containing Peptide 64, below, whose sequence 
differed from that of Peptide 22 by the conservative 
change of the residue at position 215 from glutamic 
acid of Peptide 22 to aspartic acid. 

30 

(22) AVLEDPYILLVSSKV 
(64) AVLEEPYILLVSSKV. 

T cell clone K8AH is able to discriminate 
35 between M. tuberculois and M. leprae presented for 

stimulation as whole bacilli, and was similarly able 
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to exhibit the same discrimination at the peptide 
level. Thus, the M . tuberculosis -related Peptide 24 
(231-245) could be used to stimulate clone K3AH, 
whereas contact with an inoculum containing Peptide 
5 65, below, having the analogous leprae sequence 

did not stimulate that clone to proliferate. Inocula 
containing Peptide 65 also did not stimulate clone 
A7JM or polyclonal cell line JM . 

10 (24) LLPLLEKVIGAGKPL 

(65) LLPLLEKVIQAGKSL. 

As can be seen from a comparison of the 
above sequences of Peptides 24 and 65, those peptides 

15 differ by the substitution of two residues near their 
carboxy-termini . The glycine (G) at position 240 of 
Peptide 24 is substituted as a glutamine (Q) in 
Peptide 65, and the proline (?) at position 244 of 
Peptide 24 is substituted as a serine (S) in Peptide 

20 65. 

Thus, recognition of Peptide 24 by clone 
K8AJ must be influenced by one or both of glycine-240 
and proline-244. Interestingly, an inoculum 
containing Peptide 25 (241-255) , which contains 

25 proline-244, did not cause stimulation of clone K8AH 
cells when admixed and contacted with those cells. 

A blocking study was carried out to 
determine whether an inoculum containing Peptide 65 
could inhibit the stimulatory response caused by 

30 Peptide 24 on cells of T cell clone K8AH. Those 

studies showed that the M. lepr ae -related Peptide 65 
could not block the response induced by the M. 
tuberculosis - related Peptide 24. This finding again 
implies the criticality of one or both of the 

35 residues at positions 240 and 244 of Peptide 24. 
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Further stimulation studies were carried out 
using M. leprae -r elated and M. tuberculosis pepides 
and the before-mentioned T cell clones and cell 
line. An inoculum containing Peptide 23 caused 

5 stimulation of the polyclonal cell line. The 

sequence of that peptide is identical in both M. 
leprae and M. tuberculosis . (See also Table 6.) 

In addition, two M . lepr ae -related Peptides, 
64 and 66, that each contain a single amino acid 

10 residue substitution as compared to their analogous 
M . tuber culosi s - related Peptides, 22 and 46, 
respectively, also were capable of eliciting 
stimulation of M. lepr ae -immune cell line JM when 
inocula containing one or the other were admixed and 

15 contacted with those cells. Neither M . 

lepr ae - related Peptide 64 nor Peptide 66 stimulated 
cells at T cell clone K8AH. The sequences of Peptide 
66 and of its analogous Peptide 46 are shown below. 

20 (46) IAFNSGLEPGVVAEK 

(66) I A F N S G M E P G V V A E K. 

Each of Peptides 64," 65 and 66 corresponds 
substantially to Peptide 22, 24 and 46, 

25 respectively. That substantial correspondence 

notwithstanding, the results above illustrate that 
there can be differences in reactivities of such 
peptides at the T cell level. 

That different reactivities in T cell 

30 stimulation were found for substantially 

corresponding peptides that differed in sequence is 
not particularly surprising in view of the type of 
interaction thought to be involved in T cell 
stimulation by an antigen as compared to an 

35 antigen-antibody interaction. 
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Thus, an antigen-antibody interaction is 
usally considered to be a relatively simple 
ligand-receptor interaction in which substitutions of 
polar for polar (e.g., glutamic for aspartic in 

5 peptides 22 and 66) or apolar for apolar of about the 
same size (e.g., leucine for methionine in Peptides 
4S and 67) typically are not of great consequence. 
Indeed, it has been shown that for some 
influenza-related 13-residue peptides, drastic 

10 substitutions can occur with little differences being 
noted in binding by a monoclonal antibody raised to 
the parent peptide. See, for example U.5 Patent No. 
4, 631 ,211 . 

T cell stimulation, on the otherhand, is 

15 thought to resemble a sandwich in which the T cell 
and the antigen-presenting or feeder cell are the 
bread and the antigen is the filling. Thus, a 
peptide aritigen must bind to two receptors, one on 
the T cell, and the other, believed to be one or more 

20 proteins of the major histocompatibility complex . 

(MCH) , on the feeder cell. In addition, the binding 
between antigen and each of the T cell and feeder 
cell receptors is thought to be weaker than is the 
usual antigen-antibody binding. It was not therefore 

25 surprising that the glycine to glutamine and proline 
to serine substitutions between Peptides 24 and 65 
resulted in differences in T cell stimulation. 

As noted previously, T cell stimulation can 
be manifest in a number of manners. The previous 

30 discussion has centered primarily on _in vi tro and in 
vivo proliferation assays. The results discussed 
below using T cell clones K8AH and A7JM illustrate 
further manisf es tations of T cell stimulation, and 
manners in which such stimulation can be determined. 

35 Standard assays for secretion of IL-2, 

granulocyte macrophage-colony stimulating factor 
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(GM-CSF) and i nt erf eron-gamma secretion into the 
supernatants of aqueous stimulatory T cell cultures 
were conducted using the above-cloned T cells to 
illustrate stimulation. Cytotoxicity against 
macrophages pulsed with the same stimulatory peptide 
or whole mycobacteria was also assayed- Details of 
these studies are provided in the Materials and 
Methods Section. The results are shown in Table 7, 
below . 

Table 7 
Protein 540 Peptide-Induced 
Stimulatory Responses In T Cell Clones 3 " 



15 



T cell clones' 



IL-2 



GM-CSF 



IFN- 

i 

Gain in a' 



Cytotoxi- 

• ^ 6 
city 



K8AH + APC 



<L0 . 2 



11+12 ( 9%) 



10 



ND 



2 0 K 8 AH + APC + 
Peptide 24 



9 . 4+0.6 



153+30 (>100%) 56 



86 . 5+0 . 6 



K 8 AH + APC + 
M. tuberculosis 7.6+0.2 



87+35 ( 71%) 



44 



85 . 8+2 . 9 



25 



A7JM + APC 0.2 5+8 ( 4%) 5+1 ND 

A7JM APC + 

30 Peptide 22 6.8+0.6 135+4 (>100%) 53+7 82.7+10 

A7JM + APC + 

M. tuberculosis 6.8+0.1 160+14 (>100%) 40 84.7+3 



35 



T cell clones were stimulated by the peptide 
antigen shown hereinbefore to specifically activate each 
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clone. Stimulation was assayed by four methods. T cells 

and antigen-presenting cells .(APC) without antigen were 

used as negative controls in all assays. Results are 

expressed as the mean + standard deviation (where 

5 calculated) of duplicate or triplicate cultures. 
2 

Culture contents in addition to the medium 

are shown in each entry, with plus signs (+) indicating 

the presence of mixed cellular components and antigen 

(peptide or M. tuberculosis ) , where present. 

10 ^ Results expressed in units per ml. 

4 

Assay based on three independent studies 
using three different bone marrow donors. Results are 
expressed in colony-forming units of GM per 2x10^ 
cells. Parenthesized percentages relate to the number of 
15 colonies induced by a GM-CSF positive control supernatant. 

^ Results expressed international units per ml. 
Results expressed as percentages as discussed 
in the Materials and Methods Section. APC + antigen was 
used as a negative control. 
20 . 7 ND = not done . 



The results of Table 7 illustrate further 
standard techniques that are useful in determining 
the presence of stimulated T cells in addition to the 
25 proliferation assays discussed before. 

Assays of T cell clones K8AH and A7JM 
indicated that both showed the helper/inducer 
(T4 + ,T8~) phenotype. Cells of the T4 + 
phenotype are primarily helper cells that recognize 
30 antigen plus class II HLA proteins. Such cells are 
also known to exhibit cytotoxicity as is shown in 
Table 7. 

Tuberculosis is a disease in which the 
cellular portion of the immune response is involved 
35 to the substantial exclusion of the humoral 

(antibody) portion of the immune response. Thus, the 
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ability of the preferred peptide antigens to 
stimulate the T cell clones to not only proliferate 
but to also secrete IL-2, GM-CSF, and interferon 
gamma, each of which constitutes a portion of the 

5 cellular immunity response, indicates that those 

peptides, their polymers, and mixtures thereof, as 
well as the 540 protein (65 KD protein) can play an 
important role in protective immunity. 

That role in cellular immunity is 

10 underscored by the macrophage cytotoxicity exhibited 
by the clones stimulated by the peptides or the whole 
mycobacteria. Similar cytotoxicity for other 
mycobacter ia-reacti ve T cell clones has been 
reported. [Mustafa et al . , (1987), Clin. Exp. 

15 Immunol . , _69 : 255-262 ; Kaufman et al . (1986), Lep . 

Rev . 57 , Suppl . 21:101-111.] However, it is believed 
that the- above results are the first demonstration 
that the same sequence of one protein antigen are 
involved in both T cell help and cytotoxicity. the 

20 i^n vi vo role of such T4 + cytotoxic T cells is 

believed to destroy those macrophages that have 
become incompetent to kill their 
intracellularly- growing mycobacter i a . 

A preferred peptide was previously described 

25 herein as being capable of stimulting 

mycobacteri ally- immune mononuclear cells, and such a 
peptide was said to be useful in assaying for present 
or prior immunological exposure of such cells to 
mycobacteria. A particularly preferred peptide or 

30 its polymer is also capable of immunizing an animal 
for protection against mycobacterial infection such 
as M. tuberculosis . 

Thus, the present invention also 
contemplates a vaccine against mycobacteria such as 

35 M. tuber culosi s that comprises a physiologically 
tolerable diluent containing as immunogen an 
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immunizing effective amount of (i) a peptide whose 
amino acid residue sequence corresponds substantially 
to a particularly preferred T cell-stimulating 
peptide described herein or (ii) a polymer of such a 

5 particularly preferred T cell stimulting peptide as 
described herein. 

Exemplary particularly preferred peptides 
include peptides 22 and 24. Further particularly 
preferred peptides are those whose sequences 

10 correspond substantially to a sequence of the M. 

tuber culosi s 540 protein or another mycobacterial 65 
KD protein and contain 5 to about 40, more preferably 
about 10 to about 20 residues, and still further are 
capable of stimulating proliferation of 

15 mycobacter ially- immune , and for a tuberculosis 

vaccine, M. tuber culosi s - immune , T cells that exhibit 
the T4 + and/or T8 + phenotype. 

Further particularly preferred peptides can 
be obtained by following a procedure similar to that 

20 discussed previously. Polyclonal T -cells from- one -or 
more individals are contacted with an inoculum 
containing a peptide such as one of those of Tables 2 
and 4, and more particularly peptides such as those 
of Tables 5 and 6 that have already been shown 

25 capable of stimulating proliferation of 

mycobacter i ally- immune T cells. The peptides 
inducing proliferation are noted and the 
proliferating T cells are cloned by the limiting 
dilution technique as described by Oftung et a.l . , . 

30 (1987) J . Immunol . , 138 : 927-931 . The phenotypes of 
the proliferated T cells are determined as with the 
OKT series of monoclonal antibodies available from 
Ortho Diagnostic Systems, Inc. of Raritan, N J . One 
or more of the peptides capable of causing 

35 proliferation of T cells having the T4 + and/or 
T8 + phenotype is utilized in the vaccine. 



'/ 
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More preferably, a mixture of peptides, 
polymers having such peptides as repeating units, or 
a polymer whose repeating units are a mixture of such 
peptides that cause proliferation of T4 and/or 

5 T8 + T cells is used. The reason of this preference 
stems from the already noted MHC restriction. In 
addition, there is usually found an MHC restriction 
between T4 + and T8 + T cells, the former 
recognizing antigen plus class II MHC protein and the 

10 latter recognizing antigen plus class I MHC protein. 

Peptides that correspond substantially to 
portions of the 517 protein are also useful herein, 
and are defined as to substantial correspondence 
similarly to those peptides discussed before. The 

15 peptides substantially corresponding to a sequence of 
the 517 protein can contain as few as five residues 
and are therefore somewhat shorter than are the 
shortest of the peptides discussed before. 

Three peptides (denominated 55, 56 and 57) 

20 ■ and their variants- substantially correspond to 
sequences, written from left to right in the 
direction from ainino-t erminus to carboxy- terminus and 
using one letter code, having the formulas 

25 55) N N N I G, 

56) X G N Z G, and 

57) FNSGSGNIG F(I) G N S G 

wherein X is an amino acid residue selected 
30 from the group consisting of F, S, T, L, D and T; Z 
is an amino acid residue selected from the group 
consisting of T, I, L, S and V; and the parenthesized 
residue can replace the residue shown to its left in 
the sequence. Thus, in peptide 57, F and I are 
35 alternative residues. More preferably, X is selected 
from the group consisting of F, S and T; and Z is 
selected from the group consisting of T and I. 
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Using the before-described consensus matrix 
to calculate whether the variant pentapeptides 
defined hereinbefore by the consensus sequence XGNZG 
correspond substantially, one finds that all of those 

5 variants correspond substantially at least at 99% 
confidence level. This can be readily seen by 
determining the greatest differences caused by 
substitutions, then calculating the resultant 
consensus matrix score f and comparing that value to 3 

10 times the number of residues compared, 5, {3x15=15). 

Thus, for the X residue, substituting an 
lie (I) for an Asp (D) residue, or a Ser (S) for a 
Phe (F) provides a value of -3 from the matrix. 
Similarly for Z, substitution of lie (I) for Ser (S) 

15 or Ser (S) for Val (V) provides a value of -2 from 
the matrix. Since two Gly (G) residues and the 
Asn (N) residues are present in any of the before 
compared consensus pentapeptide sequences, the 
presence of those residues provides a score of 22 

20 (3 + 5 + 3 = 22). Subtraction of five [ {- 3 ) + (- 2 ) ] . f or the 
above substitutions" f rom 22 provides a total score 
for the compared pentapeptides of 17. 

Since 17 is greater than 15, any of the 
above substitutions to the consensus sequence 

25 provides pentapeptides that correspond substantially 
at least at the 99% confidence level. Furthermore, 
since the above substitutions caused the greatest 
numerical difference in the total score, any n other of 
the before-discussed substitutions for both X and Z 

30 in the consensus sequence produces a total score; 

i.e., where X is Thr or Leu and z is Thr or Leu, in 
the consensus sequence produces a total score that is 
larger than 17, and consequently, all of those 
pentapeptides also correspond substantially to each 

35 other at least at the 99% confidence level. 
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Peptides 55 and 56 are typically utilized as 
one of a plurality of repeating units of a polymer 
having a relatively low molecular weight; i.e., less 

5 such polymer, or oligomer, contains two of the five 
residue peptides (pent apepti des ) bonded together 
through a peptide bond formed between the 
car boxy- terminal residue of a first pentapeptide 
repeating unit and the amino- terminal residue of a 

10 second pentapeptide repeating unit. 

For example, Peptide 57, above, can be 
viewed as a polymer or oligomer having two such 
pentapeptide repeating units bonded together by a 
peptide bond, and also containing an additional four 

15 residues at the amino-terminus of the oligomer. 

Similar calculations can also be carried out 
for variants of the other peptides disclosed herein 
as one means of determining whether a peptide with a 
different sequence from one of those specifically 

20 enumerated. corresponds substantially to a 

specifically enumerated - peptide , or to a portion 
thereof. For the purposes of epitope-paratope 
interactions, sequences containing at least five 
residues are the shortest sequences that should be 

25 compared since at least five or six residues appear 

to be required for epitope-paratope interaction. See 
for example, Elder et al . (1987) J. Virol . 61: 3-15; 
Atassi (1975) Immunochemi s try 1^2:423-438; and 
Benjamini et al . (1969) Biochemistry 3:2242-2246. 

30 Similarly, the sequence in isolated form 



NNNIGNNNIGNNNIG 



35 



that is also present at nucelotide positions 3270 
through 3226 of Figure 2 can be considered a 
polymeric or oligomeric trimer of the sequence of 
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peptide 55. Likewise, an isolated form of the 
sequence from nucleotide position 3210 through 
position 3107 can be viewed a polymer or oligomer 
that contains eight XGNZG pentapept i des repeated. 
Each of above polymers or oligomers contains a 
plurality of the pentapeptide repeating units bonded 
together by peptide bonds. 

Solid phase peptide synthesis techniques, as 
are described in the before-discussed U.S. Patents 
whose disclosures are incorporated herein by 
reference, are typically the most useful means of 
preparation for oligomers and polymers containing up 
to a total of about forty total residues (eight 
repeating pentapeptide units). 

Genetic engineering techniques as are 
described herein are particularly useful for 
preparing larger polymers that contain more than 
about eight pentapeptide repeating units. For 
example, a double stranded DNA molecule having the 
sequence shown. in Figure. .2 from nucleotide position 
2959 through nucleotide posi tion . 3303 , and in phase 
with the illustrated amino acid residue sequence of 
protein 517" can be excised from the larger molecule 
shown in Figure 2 or synthesized from appropriate 
deoxyribonucleic acid derivatives using known 
techniques, and thereafter ligated into an 
appropriate plasmid vector for expressing a peptide 
polymer that corresponds substantially in sequence to 
the polymer containing the pentapeptide repeating 
units shown beneath the sequence at those positions 
in Figure 2. 

Higher molecular weight polymers; i.e., with 
average molecular weights of about 10,000 to 
1,000,000, or more, containing one or more of the 
above 540 protein or 517 protein pentapeptide 
repeating units can also be prepared by oxidatively 
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polymerizing a peptide that is terminated with 
cysteine (Cys; C) residues, or a " diCys- termi nated" 
peptide. The resulting polymer thereby contains its 
repeating units DOuueu Luyemci. oxiui ^eu cpi.cine 

(cystine) disulfide bonds. 

For example, each of the before-discussed 
540 protein peptides or 517 protein pentapept ides can 
be synthesized to contain an additional Cys residue 
at each of the amino- and car boxy-termini to provide 
diCys- terminated peptides in their reduced forms. 
After synthesis, in a typical laboratory preparation, 
10 milligrams of the diCys peptide (containing 
cysteine residues in un-oxidized form) are dissolved 
in 250 milliliters (ml) of 0.1 molar (M) ammonium 
bicarbonate buffer. The dissolved diCys- termi nated 
peptide is then air oxidized by stirring the 
resulting solution gently for period of about 18 
hours in the air, or until there is no detectable 
-free mercaptan by the Ellman test. [See Ellman , 
Arch. Biochem. Biophys ., _82:70-77 (1959).] 

The polymer so prepared contains a plurality 
of the synthetic, peptide repeating units that are 
bonded together by oxidized cysteine (cystine) 
residues. Such polymers typically contain their 
peptide repeating units bonded together in a 
head-to-tail manner as well as in head-to-head and 
tail-to-tail manners; i.e., the ami no-termini of two 
peptide repeating units can be bonded together 
through a single cystine residue as can two 
car boxyl- termini since the linking groups at both 
peptide termini are identical. 

A 517 protein pentapeptide repeating unit 
can itself be contained in the form of an oligomer 
containing up to about eight pentapeptide repeating 
units, or in a shorter peptide such as the 14-residue 
Peptide 57. Still further, a genetically engineered 
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polypeptide such as that prepared from the DNA 
sequence of nucleotides at positions 2959 through 
3303' that has been further engineered to include" 
codons for Cys (TGT or TGC ) at the 5'- and 3' -ends 
can also be polymerized. 

The molecular weight of such a polymer can 
be controlled through the addition of chain- 
terminating reagents. Exemplary chain terminating 
reagents are cysteine itself and a peptide such as a 
before-described pentapeptide that further includes 
single Cys residue, preferably at a terminus. 

The full names for individual amino acid 
residues are sometimes used herein as are the 
well-known three letter abbreviations. One letter 
abbreviations (code) is also utilized. The Table of 
Correspondence, below, provides the full name as wel 
as the three letter and one letter abbreviations for 
each amino acid residue named herein (See, for 
example, L. Stryer, Biochemistry , 2nd ed . , W. H . 
Freeman and Company, San Francisco, (1981), page 
16) . The amino acid residues utilized herein are in 
the natural, L, form unless otherwise stated. • 

Table of Correspondence 

Three letter One letter 



Amino acid abbr evi ation symbol 

Alanine Ala A 

Arginine Arg R 

Asparagine Asn N 

Aspartic acid Asp D 

Asparagine or aspartic acid Asx B 

Cysteine Cys C 

Glutamine Gin Q 

Glutamic acid Glu E 

Glycine Gly G 

Histidine His H 
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louicuoxiic 


Tie 

X J- w 


x 


Li e uc ine 


LlCU 


T 
Jj 


T uci n ^ 
jj y ojl uc 




K 


Methi oni ns 


X v i e u 


M 


Phpnvl alanine 


Phe 


F 


Prol ine 


Pro 


P 


Serine 


Ser 


S 


Threonine 


Thr 


T 


Tryptophan 


Trp 


W 


Tyrosi ne 


Tyr 


Y 


Val ine 


Val 


V 



III . MATERIALS AND METHODS 
A . Recombinant studi es 

15 1 . Bacteria, Phage and Plasmids 

^ . col i strains used in this work were 
BNN97 [Young et al . , (1983) Science , 222 : 778-732; 
ATCC 37194]; JM83 [ Yani sch-Per ron et al . , (1985), 
Gene , 33: 103-119; also ATCC 35607]; JM101 

20 [Yanisch-Perron et .al . , (1985) ,, Gene , 33: 103-119 ; 
also ATCC 33876] ; Y1089 [Young et al • , (1983), 
Science , 222 : 773-782; also ATCC 37196]; and Y1090 
[Young et al . , (1983), Science , 222 : 778-732; also 
ATCC 37197]. Plasmids pUC19 [Yanisch-Perron et al . , 

25 (1985), Gene , 3J3:103-119] and pMC1871 [Shapira et 
al . , (1983), Gene , _25: 71-82] were purchased from 
Pharmacia Fine Chemicals, Piscataway, N J . The 
recombinant DNA library of tuber culosi s genomic 
DNA fragments in the ^ gtll vector was constructed by 

30 R. Young et al . (1985), Proc. Natl. Acad. Sci, USA , 
_82: 2583-2587 , and made available through the World 
Health Organization's Program for Research in the 
Immunology of Tuberculosis. Recombinant phage 
,ARY3143 and ARY3146 were generously provided by 

35 R.A. Young [Whitehead Institute, M.I.T.; Young et 
al . , (1985), Proc. Natl. Acad. Sci. USA, 
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J32: 2583-2587] . Subclones of the mycobacterial DNA 
inserts in these recombinant phage were constructed 
in pUC19 or M13mp9 [Messing et al . , (1982), Gene , 
19:269-276; Ml3mp9 is listed for sale in the August, 

5 1983 catalog of Bethesda Research Laboratories, Inc.] 
vectors using standard recombinant DNA techniques 
[Maniatis et al . , (1982), Molecular Cloning - a 
laboratory manual , Cold Spring Harbor Laboratory, 
Cold Spring Harbor, NY]. 

10 2 . Anti sera 

Monoclonal antibodies specific for the 65XD 
antigen were obtained from the Immunology of 
Tuberculosis Scientific Working Group under a grant 
from the WHO/World 3ank/UNDP Special Program for 

15 Vaccine Development. These antibodies included IT- 13 
[WTB-78; Coates et al . , (1981), Lancet, J2 :1 67-169]; 
IT-31 [SA2D5H4; T. Buchanon , unpublished] and IT-33 
[MLIIH9; Gillis et al . , (1982), Infect. Immun . , 
37_:172«178] • Ant i-bet a-galactosidase antibodies were 

20 purchased from- Cooper biomedical , Malvern, PA. 
Polyclonal rabbit antisera directed against a 
sonicate of M. tuberculosis strain H37Rv were 
elicited as previously described [Minden et al . , 
(1984), Infect. Immun. , 45: 519-525] - 

25 3. Immunoscr eening of Agtll-M. 

tuber cul os i s Library 

Clones reactive with the monoclonal 
antibodies specific for the 65KD antigen were 
isolated essentially as described by Young et al . 

30 [Young et al . , Proc. Natl. Acad. Sci . USA , 

_32: 2583-2587] • Briefly, for each 150 mm LB plate, 
0.6 ml of a fresh overnight culture of Y1090 cells 
were infected with l-2xl0 5 plaque-forming units 
(pfu) of the library. After 3.5-4 hours of growth at 

35 42°C, the plagues were overlaid with a dry 

nitrocellulose filter that had been saturated with 10 
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millimolar (mM ) i sopropyl-bet a-D- thiogalactopyr anoside 
(IPTG; available from Sigma Chemical Co.). The 
plates were incubated an additional 3.5-4 hours at 
37°C and then removed to room temperature and the 

5 position of the filters marked. 

The filters were washed briefly in T3ST [50 
mM Tris-HCl, pH 8, 150 mM NaCl, 0.05% Tween 20 
[ polyoxyethylene (20) sorbitan monolaurate] ] and then 
incubated in TBST plus 20% fetal calf serum. After 

10 30 minutes at room temperature, the filters were 
transferred to TBST plus antibody. 

For the initial screen, the antibody mix 
contained a 1:1000 dilution of admixed IT-13, IT-31, 
and IT-33. The filters were incubated with the 

15 antibody solution overnight at 4°C with gentle 
agitation, washed in TBST and reacted with 
biotinylated goat anti-mouse immunoglobulin, the 
Vectastain ABC reagent, and developer as described by 
the manufacturer (Vector Laboratories, Burlingame, 

20 CA) . After the color had developed, the filters were 
washed with several changes of water and air dried. 

Phage corresponding to positive signals were 
twice plaque-purified. To determine which monoclonal 
antibodies reacted with which of the recombinant 

25 phage, about 100 pfu of a purified phage stock were 

inoculated in a small spot on a lawn of Y1090 E. col i 
on an LB (Luri a-Ber tani broth) plate. The phage were 
allowed to grow and induced to synthesize the foreign 
proteins as described above. The filters were then 

30 reacted with a 1:1000 dilution of one of the 

monoclonal antibodies. The subsequent steps were the 
same as for the initial screen. 

4 . Western Blot Assays 

Cells containing phage or plasmids in which 
35 the expression of the foreign sequences was under the 
control of the col i lac gene regulatory sequences 
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w.ere induced to synthesize the foreign proteins by 
incubating the cells in the presence of 2.5 mM IPTG 
for 2 hours. Crude lysates of cells expressing 
^ gtll recombinants were made as described in Huynh 

5 et al; (1985)/ DNA Cloning Techniques: A Practical ; 
Gover , ed., IRL Press, Oxford, Vol. I, pp. 49-78. 
Briefly, those lysates were made by harvesting cells 
from overnight cultures and resuspending the cells in 
10 mM Tris pH 7.5, 10 mM EDTA containing 100 ug 

10 lysozyme/ml. After 10 minutes at room temperature, 
sodium dodecyl sulfate (SDS) was added to a final 
concentration of 0.5%. A protease inhibitor 
(Trasylol, Boehringer Mannheim, Indianapolis, IN) was 
added to all crude lysates at a final concentration 

15 of 0.03%-0.3%. 

The crude protein preparations were 
electrophor esed on 10% polyacrylamide-SDS Laemmli 
gels [Laemmli, (1970) Nature , 227 : 680-635] , and the 
separated proteins electrophor eti cally transfered to 

20 nitrocellulose [Towbin et al.-, (1979), Proc . Natl. 
Acad . Sci . USA , 7_6 : 43 50-43 54 ] . The immobilized 
proteins were reacted with a 1:1000 dilution of 
monoclonal antibody IT-13 in TBST overnight at 4°C. 
The nitrocellulose filters were then washed, reacted 

25 with peroxidase-conj ugated goat anti-mouse 

immunoglobulin, and developed as previously described 
[Niman et al . , (1983), Proc. Natl. Acad. Sci. USA , 
80:4949-4953] . 

5 . Nucleic Acid Sequencing 

30 The sequences of 5 1 -end-labeled restriction 

fragments of the mycobacterial DNA were determined by 
a modification of the partial chemical degradation 
technique of Maxam and Gilbert [Brow et al . , (1985), 
Mol . Biol. Evol ,, 2 > :1 ~ 12 ? and Maxam et al . , (1975), 

35 Proc. Natl. Acad. Sci. USA , _74 : 560-564] . For the 

Ml3/dideoxy sequencing studies, Sau3AI fragments from 
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the mycobacterial DNA inserts were subcloned into the 
BamHI site of M13mp9. Phage DNA was isolated from 
the Ml 3 recombinants and subjected to the dideoxy 
chain termination sequencing reactions [Biggin et 

5 al . , (1983), Proc. Natl. Acad. Sci . USA , 

_80: 3963-3965; and Sanger et al . , (1980), J . Mol . 
Biol . , 143 : 161-178] . The products of the sequencing 
reactions were electrophor esed on 6% acr ylami de/7M 
urea/0. 5-2. 5xTBE gradient sequencing gels, [Biggin, 

10 (1983), Proc. Natl. Acad. Sci. USA , 80: 3963-3955] . 

The gels were dried under vacuum and exposed to Kodak 
XRP-1 film. The nucleotide sequences were determined 
i ndependantly for both strands of the mycobacterial 
DNA. 

15 Computer-aided analyses of the nucleic acid 

sequences and deduced protein sequences were 

performed using the databases and programs provided 

by the Nucleic Acid and Protein Identification 

Resources of the National Institutes of Health as 

20 well as the programs of Chow et al . , (1978) Adv . 

Enzym . , _47: 45-148 and Hopp and Woods [Hopp et al . , 

(1981), Proc. Natl. -Acad. Sci . USA , 73: 3824-3828] . 

6 . Bet a- gal act os idase assays 

Cells were grown in B broth or B broth plus 

25 2.5 mM IPTG to an optical density at 600 nanometers 

(OD g00 ) of about 0.3. Crude lysates were made, and 

bet a-galactosidase was activity assayed as described 

n 

by Miller (1972) , Experiments in Molecular Genetics , 
Cold Spring Harbor Laboratory, Cold Spring Harbor, NY. 
30 7. Capacity of Recombinants 

to Elicit DCH 

a . DCH Assays 

Studies were carried out to determine 
whether the recombinant proteins or purified protein 
35 derivative (PPd) (Connaught Laboratories, Ltd., 

Willowdale, Canada) would elicit DCH reactions in 
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Hartley guinea pigs that had been immunized with 
sonicates of M. tuberculosi s , M. bovi s or saline. 
Groups of guinea pigs were given three weekly 
intramuscular (i.m.) injections of sonicates 

5 suspended in incomplete Freund's adjuvant (IFA) as 
the physiologically tolerable diluent. Each 
injection contained 1.0 milligram (mg) of protein. 
Some animals received a fourth injection so that one 
week after the final injection, all animals were 

10 tested intr adermally (i.d). Test antigens included 

the crude and partially purified recombinant extracts 
as well as saline and PPd as controls. Test antigens 
were used at 1-10 ug diluted in 100 ul of 
phosphate-buffered saline at a pH value of pH 7.0 

15 (PBS) , containing 0.0005% Tween 20 as the 

physiologically tolerable diluent. Groups of 
unimmunized guinea pigs were similarly tested. All 
i.d. injections were administered into shaved areas 
on guinea pig flanks. Reactions were read at 24, 48 

20 and 72 hours, and were considered positive when the 
diameters of erythema and indurated areas exceeded 
10 mm. 

b • Preparation of Crude Lysates 
E. coli containing a plasmid or lambda phage 
25 of interest were grown by incubation at 37 degrees C 
with aeration in 3-broth to late phase in which 
absorbance at 600 nanometers (AgQg) was between 
approximately 0.4 and 0.6. IPTG was then added to a 
final concentration of 10 mM and the bacteria were 
30 further incubated for two hours. 

The bacterial culture was then chilled on 
ice for 10 minutes and the cells were harvested by 
cent r i f ugation at 6000 rpm for 10 minutes. The 
resulting cell pellet was washed once in TBS (50 mM 
35 Tris, pH 8, 150 mM NaCl) by resuspension and 

recentrif iguation , and was thereafter resuspended 
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(Sigma Chemical Co., St. Louis, MO) in a volume of 
TBS with 0.5 molar sucrose equivalent to 1/10 the 
original culture volume. Lysozyme was added to the 

— i- * ^^niT^^nn^o^ en 1 tl f l nn ^ O 3 final 

5 concentration of 50 ug/ml, and that admixture was 

incubated for 10 minutes at 37 degrees C. Cells were 
harvested by centr i f ugation and were resuspended in 
an equal volume of TBS . Thereafter, DNAse , Trasylol 
and SDS (Sigma) were added to the resulting admixture 

10 such that the final concentrations were 1 ug/ml, 0.1% 
and 1%, respectively. That admixture was further 
incubated at room temperature for a time period of 10 
minutes with periodic mixing to effect completion of 
cell lysis. The resulting crude lysate was stored at 

15 -20 degrees C until use. 

c. Partial Purification of 
Expressed 65KD Protein 

Proteins containing the 65KD antigens were 
partially purified from crude lysates of E. col i 

20 expressing that protein by differential ammonium 

sulfate precipitation. To that end, a crude lysate 
was first combined with a solution of saturated 
ammonium sulfate (SAS)' to give a final concentration 
of 30% of the original lysate concentration. 

25 Precipitation was effected as is well known in the 
art, and the resulting supernate was retained. The 
supernate was then combined with SAS to give a 
concentration of 50% of that of the original lysate, 
and precipitation effected again. The resulting 

30 pellet was retained, resuspended in PBS and dialysed 
against PBS. This resulting dialysed material is 
referred to as partially purified. 

d. Preparation of Extracts 
of M . tuberculosis 

35 M. tuberculosis strain H37Rv and M. bovis 

strain BCG were obtained from the culture collection 
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of the National Jewish Hospital and Research Center, 
Denver, CO, and grown as previously described [Minden 
et al. # (1977) Science, 176 : 57-58 and Minden et al., 
(1972) Infect, Immun . , _6: 574-582]. 

5 Bacteria were then heat-killed and broken by 

sonication with ultrasonic treatment until, by 
microscopic examination, greater than 95% of the 
cells were disrupted. These disrupted bacteria were 
then subjected to ul tr acentr i f ugat ion at 200,000xg 

10 for a time period of 2 hours, and the supernate was 

retained. The supernates so obtained are referred to 
as H37RV-S and BCG-S, repectively, and their 
antigenic and biological characteristics have been 
descr ibed . 

15 3 . Peptide Studies 

1 . Mycobacter ial antigens 
Armadillo derived killed M. leprae was 

supplied by Dr. R. J. W. Rees, Mill'Hill in London, 
from the IMMLEP (WHO) bank. M. tuberculosis and 

20 M. bovis BCG were kindly donated by Dr. Eng , National 
Institute of Public Health, Oslo, Norway, Bacilli 
were killed by irradiation (2.5 m rad) . Recombinant 
M . tuberculos is 65 KD antigen, expressed from^gtll 
as a beta-galactosidase fusion protein, were purified 

25 from E. coli lysates prepared as described in Oftung 
et al., (1987) J. Immunol ., 138 : 927-931 on a high 
affinity anti-beta-galactosidase column (Promega 
Biotech, Madison, USA). 

2 . Synthetic peptides 

30 The protected peptide resins were prepared 

by usual Merrifield solid phase techniques in groups 
of 100 by the method of Simultaneous Multiple Peptide 
Synthesis [Houghten, (1985) Proc. Natl. Acad. Sci. 
USA , 82: 5131-5135; and Houghten et al., (1985) Inter . 

35 J . Pept . Prot . Res . , 27: 673-678], and were cleaved 

twenty-four at a time in a new multi-vessel apparatus 
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[Houghten et al . , (1986) Biotechni ques , _4: 522-523]. 
Each synthesis resulted in the generation of 50-75 mg 
of peptide. Typical purities of the crude peptides 
om 65-95%. 

5 3 . T-cell clones and lines 

The T-cell clone K8AH from a tuberculosis 
patient (AH) and the T-cell clone A7JM from a killed 
M. leprae - vaccina ted person (JM) were established by 
the limiting dilution technique as described [Oftung 

10 et al . , (1987) J. Immunol ., 138 : 927-931 ] . The T-cell 
line was raised from peripheral blood mononuclear 
cells (PBMC) of the donor JM by stimulation of 
2xl0 6 P3MC/ml in complete medium (RPMI 1640 + 15% 
AB serum + 1% penicillin and streptomycin) with 

15 M . bovis BCG (20 ug/ml ) in 24-well Costar plates. 

After 6 days of incubation at 37°C in an atomosphere 
of 5% CO^ snd 95% air, anti gen- r eacti ve cells were 
expanded by adding 100 U/ml recombinant IL-2 two 
times per week. After long term storage in liquid 

20 nitrogen, T cells were propagated iri vitro by 

stimulation of 2xl0 5 cells/ml in 24 well Costar 
plates with whole bacilli as- antigen (20 ug/ml) in 
the presence of 10^ irradiated .autologous PMBC as 
feeder (antigen- pr esenting ) cells and recombinant 

25 IL-2 (100 U/ml) . Efficient expansion of clones and 
lines was achieved by stimulation with antigen and 
feeder cells once and IL-2 twice per week. 
Determination of T cell subsets was performed as 
previously described [Oftung et al . , (1987) J_i 

30 Immunol . , 138 : 927-931] . 

4. Pepti de-Induced T Cell 

Clone Stimulation Assays 
The following assays were carried out for 
the inventors by Dr. Frederik Oftung, at the 

35 Laboratory for Immunology, Norwegian Radium Hospital, 
Oslo, Norway. Initial assays for T cell stimulation 
were carried out using coded samples. 
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a. Antigen-Induced Proliferation 
of T-cell Clones and Lines 

Clonal (IxlO 4 ) or polyclonal (2xl0 4 ) T 
cells and irradiated autologous PBMC (IxlO 5 ) were 

5 distributed to each well of 96-well flat bottom 
Costar plates. Mycobacterial antigens as whole 
bacilli, recombinant antigens as affinity-purified 
material or synthetic peptides were then added in 
triplicates or duplicates. The total culture volume 

10 was kept at 200 microliters (ul) . 

After 72 hours of incubation, the cultures 
were given a 4 hour pulse of 0.045 mBq 
[ 3 H1 -thymidine (specific activity = 135xl0 3 
mBq/mM) . Cells were then harvested and radioactivity 

15 incorporated was determined by liquid scintillation 

counting [Mustafa et al., (1933) Clin. Exp. Immunol . , 
_52: 29-371 . 

The results are expressed as mean 
(triplicates or duplicates) values of counts per 

20 minute (cpm) . Cells were considered to be 

proliferating in response to a given antigen where 
cpm in cultures with antigen minus cpm in cultures 
without antigen was more than 1000 and cpm in 
cultures with antigen divided by cpm in cultures 

25 without antigen was more than 2. 

b. Lymphokine Production and Assay 
T-cell clones (2xl0 5 cells/ml) were 

distributed to wells of 24-well Costar plates with 
adherent cells from IxlO 5 irradiated autologous 

30 PBMC plus antigen at optimal concentrations. Cell 

free supernatants were collected after 16 or 48 hours 
of incubation and stored at minus 20°C until assayed 
for lymphokine activities. IL-2 activity in 
supernatants harvested after 16 hours of incubation 

35 was determined by their ability to stimulate an 
IL-2-dependent mouse T-cell clone (CTLL 2) to 
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proliferate [Mustafa et al., (1985) Clin. Exp, 
Immunol . , 52_: 474-481 ] . Granulocyte macrophage-CSF 
(GM-CSF) activity in the same supernatants was 
assayed by the capacity of the supernatants to induce 

5 colony formation in mononuclear bone marrow cells 

[Dahl et al., (1972) Acta Pathol. Microbiol. Scand. 
Sect. 3 , 8_0; 863-8701 . Supernatants harvested after 
48 hours were used to determine inter feron-gamma 
activity by the method of Dahl and Degree [ Acta 

10 Pathol. Microbiol. Scand. Sect. 3 , S0_: 863-870] , using 
human embryonic lung fibroblasts and vesicular 
stomatitis virus as the challenge virus. 

c . Cytotoxicity assay 
Adherent cells from 1x10** autologous 

15 irradiated PBMC in 24-well Costar plates were pulsed 
with antigens at optimal concentrations and the 
density of T cell clones was adjusted to IxlO 5 
cells/well. After 7 days of incubation, T cells were 
washed off, and 0.5 ml of 0.03% neutral red (in 

20 saline + 10% FBS) were added to each well and the 
plates incubated for 30 minutes. Neutral red was 
then removed from the wells by washing, and the dye 
taken up by macrophages was released by adding 0.5 ml 
of 0.05 M acetic acid in 50% ethanol [Parish et al., 

25 (1983) J. Immunol. Methods , 58:225]. Percentage 

cytotoxicity was calculuated from spectrophotometr ic 

absorbance measurement at 540 nanometers [0D P1A 1 

54 0 

according to the formula: 

30 Cytotoxicity (%) = OD 540 con ' " OD 540 stud Y 

OD 54(J con. 

where OD 54Q con . = OD 54Q of control 
cultures with adherent cells + T-cell clones; and 

35 OD 540 study = OD 54Q of study cultures with 
adherent cells + T-cell clones + antigen. 
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5 . Peptide-Induced Pooled T Cell Stimulation 
Stimulation assays of pooled human T cells 
were carried out for the inventors by Dr. Stefan 
Kaufman of the Max Plank Institute for Immunology, 

5 Freiberg, West Germany. Again, coded samples were 
supplied for the assays. 

The assay procedure was as follows. 
Mononuclear cells were isolated from peripheral blood 
of M. bovis BCG -vaccinated persons on F icoll-Hypaque 

10 gradients [Emmerich et al., (1986) J . Exp. Med . , 

163 : 1024-1029 ; and Boyum, (1958) Scand. J. Clin. Lab. 
Invest . , 21 (Suppl. 97): 311, and were used to seed 
wells of a 96-well microtiter plate at about 2xl0 5 
cells/well. Antigen was then added at 0.1 ug/ml, 1 

15 ug/ml and 10 ug/ml. 

After six days of culture, 1 microCurie 
(uCi) of [ 3 H] -thymidine was added to each well. 
Eighteen hours later, cultures were harvested on 
glass fiber filters. Thymidine incorporation was 

20 measured in a liquid scintillation counter. 

For the assays of Table 5, the PBMC were 
pooled. For the studies conducted related to HLA 
restrictions, the PBMC were kept separate and the HLA 
alleles ascertained by standard techniques. 

25 The present invention has been described 

with respect to preferred embodiments. It will be 
clear to those skilled in the art that modifications 
and/or variations of the disclosed subject matter can 
be made without departing from the scope of the 

30 invention set forth herein. 
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WHAT IS CLAIMED IS: 

1. An isolated DNA molecule that consists 
essentially of the nucleotide sequence, from right to. 
left and in the direction from 5'-end to 3' -end, that 

5 corresponds substantially to the sequence represented 
by the formula in Figure 2 from about position 3950 
to about position 2390 f and in a consistent reading 
frame coding for a 517 amino acid residue protein of 
Mycobacterium tuberculos is . 

10 2. The DNA molecule of claim 1 having the 

sequence that consists essentially of the nucleotide 
sequence, from right to left and in the direction 
from 5'-end to 3 f -end, corresponding to the sequence 
represented by the formula in Figure 2 from position 

15 3943 through position 2398. 

3. A plasmid vector comprising a replicon 
operationally linked to a foreign DNA sequence that 
corresponds substantially to a DNA molecule of 
claim 1, said vector being capable of replicating 

20 said foreign DNA sequence in a replication/expression 
medium. 

4. The plasmid vector of claim 3 wherein 
said replication/expression medium is a unicellular 
organism. 

25 5. The plasmid vector of claim 3 further 

including sequence-encoded signals for initiation and 
termination of transcription that are operationally 
linked to the 5'-end and the 3'-end, respectively, of 
said foreign DNA sequence and compatible with said 

30 replication/expression medium for transcribing a 

product coded for by said foreign DNA sequence and 
expressing a protein product coded for by said DNA 
sequence . 

5. The protein produced by expression of 
35 the protein coded for by said foreign DNA of said 

plasmid vector of claim 5. 
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7. A bacterial culture comprising bacteria 
that contain the plasmid vector of claim 5 in an 
aqueous medium appropriate for the expression of the 
517 amino acid residue protein of Mycobacter ium 
tuberculosis . 

8. A method of producing a 517 amino acid 
residue protein of Mycobacter ium tuberculosis 
comprising the steps of: 

(a) culturing a replication/expression 
medium containing a plasmid vector for replicating 
and expressing a foreign DNA sequence contained 
therein, said vector containing a foreign DNA 
sequence corresponding substantially to the DNA 
molecule of claim 1, said vector additionally 
containing operatively linked nucleotide sequences 
regulating replication and expression of said foreign 
DNA sequence, said culturing being carried out under 
conditions suitable for expression of the protein 
encoded by said foreign DNA sequence; and 

(b) harvesting the expressed protein that 
is encoded by said DNA sequence* 

9, The method of claim 10 wherein said 
replication/expression medium is a culture of 
unicellular organisms. 

10, A method for determining previous 
immunological exposure, of a mammalian host to 
Mycobacter ium tuberculos is or Mycobacter ium bovis 
comprising the s*2eps of: 

(a) administering intr adermally to an 
assayed mammalian host an inoculum that consists 
essentially of the purified 540 amino acid residue 
protein encoded for by the DNA sequence of Figure 2 
or an immunologically active portion thereof, said 
protein dissolved or dispersed in a physiologically 
tolerable diluent and present in said diluent in an 
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amount effective to induce erythema and induration in 
a mammalian host previously immunized with 
M. tuberculosis or M. bovis ; 

(b) maintaining said mammal for a time 
period of about 24 to about 72 hours; and 

(c) assaying for the presence of erythema 
and induration at the site of intradermal 
administration at the end of said time period. 

11. The method of claim 10 wherein said 
purified protein is a recombinant protein. 

12. The method of claim 11 wherein said 
purified protein is a recombinant fusion protein that 
contains a portion of a beta-galactos idase molecule 
bonded to the amino- terminus of an immunologically 
active portion of said 540 amino acid residue 
protein . 

13. An inoculum consisting essentially of 
the purified 540 amino acid residue antigen coded for 
by the sequence of Figure 2 or an immunologically 
active portion thereof dissolved or dispersed in a 
physiologically tolerable diluent and present in said 
diluent in an amount effective to induce erythema and 
induration in a mammalian host previously immunized 
with M. tuberculosis or M. bovis . 

14. The inoculum of claim 13 wherein said 
protein antigen is a recombinant protein. 

15. The inoculum of claim 14 wherein said 
recombinant protein further includes a portion of the 
beta-galactosidase molecule bonded to the 
amino-terminus of an immunologically active portion 
of said 540 amino acid residue protein. 

16. A peptide that consists essentially of 
a 5 to about 40 amino acid residue sequence that 
corresponds substantially to a sequence of the 540 
amino acid residue protein or the 517 amino acid 
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residue protein coded for by the DNA sequence of 
Figure 2. 

17. The peptide of claim 16 wherein said 
sequence corresponds substantially to the sequence of 
the 540 amino acid residue protein represented by a 
formula written from left to right and in the 
direction from amino- terminus to carboxy- termn inus , 
selected from the group consisting of 
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V 


T 


R 


S 


A 


L 


Q 


N 


A 


A; 


L 


Q 


N 


A 


A 


5 


I 


A 


G 


L 


F 


L 


m 


T 


E ; 


F 


L 


T 


T 


E 


A 


V 


V 


A 


D 


K 


P 


E 


K 


E ; 


K 


P 


E 


K 


E 


K 


A 


S 


V 


P 


G 


G 


G 


D 


M; 


K 


A 


S 


V 


P 


G 


G 


G 


D 


M 


G 


G 


M 


D 


F. 
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18. The peptide of claim 16 wherein said 
sequence corresponds substantially to the sequence of 
the 540 amino acid residue protein represented by a 
formula written from left to right and in the 
direction from amino- terminus to carboxy- terminus , 
selected from the group consisting of 



A 


V 


L 


E 


D 


P 


Y 


I 


L 


L 


V 


S 


S 


K V; 






L 


L 


V 


S 


S 


K 


V 


S 


T 


V 


K 


D 


L 


L P; 






L 


L 


P 


L 


L 


E 


K 


V 


I 


G 


A 


G 


K 


P L ; 






A 


I 


L 


T 


G 


G 


Q • 


V 


I 


S 


E 




V 


G L; 






I 


A 


F 


N 


S 


G 


L 


E 


P 


G 


V 


V 


A 


E K; 






A 


R 


R 


G 


L 


E 


R 


G 


L 


N 


A 


L 


A 


D A V 


K 


V; 


E 


K 


I 


G 


A 


E 


L 


V 


K 


E 


V 


A 


K 


K; 






G 


L 


K 


R 


G 


I 


E 


K 


A 


V 


T? 


K 


V 


T E T 


L 


; a 


I 


E 


D 


A 


V 


R 


N 


A 


K 


A 


A 


V 


E 


E G. 







15 



19. The peptide of claim 16 wherein said 
sequence corresponds substantially to the sequence of 
20 the 517 amino acid residue protein represented by a 
formula written from left to right and in the 
direction from amino- terminus to carboxy- termninus , 
selected from the group consisting of: 

2 5 N N N I G, 

X G N Z G, and 

FNSGSGNIGF(I) GNSG 

wherein X is an amino acid residue selected 
30 from the group consisting of F, S, T, L , D and I; Z 
is an amino acid residue selected from the group 
consisting of T, I, L, S and V; and the parenthesized 
residue can replace the residue shown to its left in 
the sequence. 

35 
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20. A polymer comprising a plurality of 
peptide repeating units, said peptide repeating units 
consisting essentially of an amino acid residue 
sequence that corresponds substantially to a sequence 

5 of a 65KD mycobacterial cell wall protein-a f said 
peptide repeating units having the capacity of 
stimulating T cells immune to the mycobacteria of 
said 65KD mycobacterial cell wall protein-a. 

21. The polymer of claim 20 wherein said 
10 mycobacter ium is M. tuberculos is or M. bovis . 

22. The polymer of claim 21 wherein said 
peptide repeating units consist essentially of a 
sequence, written from left to right and in the 
direction from amino- terminus to carboxy-terminus, 

15 represented by a formula selected from the group 
consisting of 





A 


V 


L 


E 


D 


P 


Y 


I 


L 


L 


V 


S 


S 


K V; 






L 


L 


V 


S 


S 


K 


V 


S 


T 


V 


K 


D 


L 


L P; 




20 


L 


L 


P 


L 


L 




K 


V 


I 


G 


A 


G 


K 


P L; 






A 


I 


L 


T 


G 


G 


Q 


V 


I 


S 


E 


E 


V 


G L; 






I 


A 


F 


N 


S 


G 


L 


E 


P 


G 


V 


V 


A 


E K ; 






A 


R 


R 


G 


L 


E 


R 


G 


L 


N 


A 


L 


A 


D A V 


K V; 




E 


K 


I 


G 


A 


E 


L 


V 


K 


E 


V 


A 


K 


K; 




25 


G 


L 


K 


R 


G 


I 


E 


K 


A 


V 


T? 


K 


V 


T E T 


L; and 




I 


T? 

—1 


D 


A 


V 


R 


N 


A 


K 


A 


A 


V 


]? 


E G. 





23. The polymer of claim 22 wherein said 
peptide repeating units are bonded together by 

30 oxidized cysteine residues present at the termini of 
said repeating units. 

24. The polymer of claim 20 wherein said 
mycobacter ium is M. leprae , and said peptide 
repeating units consist essentially of a sequence, 

35 written from left to right and in the direction from 
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amino- terminus to carboxy- terminus , represented by a 
formula selected from the group consisting of 

A V L E E P Y I L L V S S K V; and 
5 xafNSGMEPGVVEK. 

25. A polymer comprising a plurality of 
pentaoeotide repeating units, each of said 
pentapeotide repeating units consisting essentially 
10 of a sequence, written from left to right and in the 
direction from amino-terminus to carboxy-termmus , 
represented by the formula 

N N I G; or 
15 X G N Z G, 

wherein X is an amino acid residue selected 
from the group consisting of F , S r T, L , D and I; and 
Z is an amino acid residue selected from the group 
20 consisting of T, I, L , 3 and V. 

26. The polymer of claim 25 wherein said 
oentapeptide repeating units are bonded together by 
oxidized cysteine residues present at the termini of 

said repeating units. 
„ 27. A method for assaying for the presence 

of an infection of M. tuberculosis in a patient 
comprising the steps of: 

a) providing a solid phase support 
comprised of a 540 amino acid residue protein coded 

30 for by the M. tuberculosis genome or an 

immunologically active portion thereof as antigen 
affixed to a solid phase matrix? 

b) admixing a liquid sample from a patient 
with said solid phase support to form a solid-liquid 

35 phase admixture; 
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c) maintaining said admixture for a time 
period sufficient for antibodies to said 540 amino 
acid residue protein present in said sample to bind 
to said antigen of said solid support; 
5 d) separating the solid and liquid phases; and 

e) determining the presence of antibodies 
bound to the solid phase support. 

28. The method of claim 27 wherein said 
antigen is a recombinant protein. 
10 29. The method of claim 28 wherein said 

recombinant protein is a fusion protein that further 
includes a portion of the beta-galactos idase molecule 
bonded to the amino- terminus of said antigen. 

30. A diagnostic kit comprising a package 
15 that contains a solid support comprised of a purified 

540 amino acid residue protein encoded by the 

M . tuberculosis genome as antigen affixed to a solid 

phase matrix. 

31. The diagnostic kit of claim 30 further 
20 including a second package that contains a labeled 

reagent that reacts with human antibodies bound to 
said solid support. 

32. A method for ascertaining the presence 
of mycobacter ially- immune mammalian mononuclear cells 

25 in a body sample comprising the steps of 

(a) admixing and contacting mammalian 
mononuclear cells to be assayed in an aqueous medium 
with a stimulating amount of both antigen-presenting 
cells and a mycobacterial antigen to form a 

30 stimulatory cell culture, said mycobacterial antigen 
comprising 

(i) a 55KD cell wall protein-a of said 

mycobacteria, 

(ii) a recombinant fusion protein 
35 containing an immunologically active portion of said 
65KD protein, or 



-103- 

(iii) a peptide consisting essentially 
of a sequence of 5 to about 40 amino acid residues 
that correspond substantially to a sequence of the 
55KD cell wall protein-a of said mycobacteria and is 
capable of stimulating mycobacter ially-immune T cells; 

(b) maintaining said stimulatory cell 
culture for a time period sufficient for 
mycobacter ially-immune mononuclear cells present to 
be stimulated and to evidence stimulation; and 

(c) determining the presence of mononuclear 
cell stimulation. 

33. The method of claim 32 wherein said 
mycobacter ially- immune mononuclear cells are immune 
to either M. tuberculosis or M. bovis , and said 
mycobacterial antigen is a peptide antigen that has a 
sequence of about 10 to about 20 amino acid residues. 

34. The method of claim 33 wherein said 
peptide antigen has a sequence, written from right to 
left and in the direction from amino- terminus to 
carboxy-terminus , represented by a formula selected 
from the group consisting of 



A 


V 


L 


E 


D 


P 


Y 


I 


L 


L 


V 


S 


S 


K 


V; 




L 


L 


V 


S 


S 


K 


V 


S 


T 


V 


K 


D 


L 


L 


P; 




L 


L 


p 


L 


L 


E 


K 


V 


I 


G 


A 


G 


K 


P 


L; 




A 


I 


L 


T 


G 


G 


Q 


V 


I 


S 


E 


E 


V 


G 


L; 




I 


A 


F 


N 


S 


G 


L 


s 


P 


G 


V 


V 


A 


E 


K; 




A 


R 


R 


G 


L 


E 


R 


G 


L 


N 


A 


L 


A 


D 


A V 


K V; 


E 


K 


I 


G 


A 


E 


L 


V 


K 


E 


V 


A 


K 


K; 






G 


L 


K 


R 


G 


I 


E 


K 


A 


V 


E 


K 


V 


T 


E T 


L; and 


I 


TT 


D 


A 


V 


R 


N 


A 


K 


A 


A 


V 


E 


E 


G. 





35. The method of claim 34 wherein said 
method is carried out in vivo, said antigen- 
presenting cells are provided endogenously and said 
aqueous culture medium is endogenous blood or lymph. 
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36. The method of claim 34 wherein said 
method is carried out ir\ vi tro , and said stimulation 
is determined by an assay selected from the group 
consisting of (i) mononuclear cell proliferation, 

5 (ii) interleukin-2 secretion, ( i i i ) in ter f eron-gamma 

secretion, (iv) granulocyte macrophage-colony 
stimulating factor secretion and (v) cytotoxicity. 

37. The method of claim 34 wherein said 
peptide antigen is present as a polymer having 

10 repeating units comprised of said peptide antigen, 
and said peptide antigen repeating units are bonded 
together by oxidized cysteine residues at the termini 
thereof . 

38. A diagnostic assay kit comprising a 
15 container that includes a mycobacterial antigen 

present in an amount sufficient to carry out at least 
one assay, said mycobacterial antigen comprising 
(i) a peptide antigen, 
(ii) a polymer of said peptide antigen 
20 repeating units in which said peptide antigen 

consists essentially of a sequence of 5 to about 40 
amino acid residues that correspond substantially to 
a sequence of the 540 amino acid residue protein 
coded for by the DNA sequence of Figure 2 and is 
25 capable of stimulating mycobacter ially-immune T cells, 

( i i i ) a fusion protein that includes an 
immunologically active portion of said 65KD cell wall 
prote in-a . 

39. The diagnostic assay kit of claim 38 
30 wherein said mycobacterial antigen is a peptide 

antigen or a polymer of said peptide antigen 
repeating units, and said peptide antigen and said 
peptide antigen polymer repeating units have an amino 
acid residue sequence, written from right to left and 
35 from amino- terminus to car boxy- terminus , represented 
by a formula selected from the group consisting of 
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A 


V 


L 


E 


D 


P 


Y 


I 


L 


L 


V 


S 


S 


K V? 




L 


L 


V 


S 


S 


K 


V 


S 


T 


V 


K 


D 


L 


L P? 




L 


L 


P 


L 


L 


E 


K 


V 


I 


G 


A 


G 


K 


P L; 


5 


A 


I 


L 


T 


G 


G 


Q 


V 


I 


S 


E 


E 


V 


G L; 




I 


A 


F 


N 


S 


G 


L 


E 


P 


G 


V 


V 


A 


E K ; 




A 


R 


R 


G 


L 


E 


R 


G 


L 


N 


A 


L 


A 


D A V 




E 


K 


I 


G 


A 


E 


L 


V 


K 


E 


V 


A 


K 


K; 




G 


L 


K 


R 


G 


I 


E 


K 


A 


V 


E 


K 


V 


T E T 


10 


I 


E 


D 


A 


V 


R 


N 


A 


K 


A 


A 


V 


E 


E G. 



40. A vaccine against mycobacterial 
infection comprising a physiologically tolerable 
diluent and an immunizing amount of (i) a peptide 

15 containing a sequence of 5 to about 40 amino acid 
residues whose amino acid residue sequence 
corresponds substantially to a sequence of a 
mycobacterial 65KD cell wall protein-a and that is 
capable of stimulating T cells immune to said 

20 mycobacter ium that have a phenotype selected from the 
group consisting of T4 + and T8 + , or (ii) a 
polymer having said peptide antigen as repeating 
units . 

41. The vaccine of claim 40 wherein said 
25 mycobacter ium is M. tuberculosis and said peptide 

antigen or said peptide antigen repeating units 
contain about 10 to about 20 amino acid repeating 
units . 

42. The vaccine of claim 41 wherein said 
30 peptide has a sequence, written from left to right 

and in the direction from amino-terminus to 
carboxy- terminus f represented by the formula 



35 
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AVLEDPYILLVS SKV; and 
LLPLLEKIGAGKP L. 

43. Paratopic molecules that immunoreact 
5 with a peptide containing 5 to about 40 amino acid 
residues that corresponds substantially in sequence 
to the 540 amino residue protein coded for by the DNA 
sequence of Figure 2 and also to said 540 amino acid 
residue protein. 
10 44. The paratopic molecules of claim 43 

wherein said peptide has an amino acid residue 
sequence, written from right to left and in the 
direction from amino-terminus to carboxy-terminus , 
represented by a formula selected from the group 
15 consisting of 

MAKT I AYDEEARRGL; and 
KASVPGGGDMGGMDF. 



25 



30 



35 
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cannot be considered to involve an inventive step when the 
document is combined with one or more other such docu- 
ments, such combtnation being obvious to a person skilled 
in the art. 

"4" document member of the same patent family 



IV. CERTIFICATION 


Date of the Actual Completion of the International Search 

13 MAY 1988 


Date of Mailing of this International Search Report 

2 8 JUN 1988 


International Searching Authority 

ISA/US 


Signature of Authorized Office* d j 

awnTTT. A. MOHAMED 



Form PCT/1SA/2 10<i 



»TMC)(Rev.11-a7) 



International Aoodcation No. 



PCT/US 88/00598 



III. DOCUMENTS CONSIDERED TO BE RELEVANT (CONTINUED FROM THE SECOND SHEET) 



Category * 



Citation of Document, w»th indication, wnere aoorooriaie. of the relevant oassages 



Relevant to Claim No »• 



US, A, 4,489,158, STRAUS . PUBLISHED , 18 

December 1984 (See columns 1-3 and 16). 

US, A, 4,575,484, STRAUS PUBLISHED, 11 
March 1986 (See columns 16-18). 



27-39 
27-39 



Form PCX ISA 210' (eura sneet) (Octooer 1981) 



i'J 



International Aooiication No. 



PCT/US 88/00598 



FURTHER INFORMATION CONTINUED FROM THE SECOND SHEET 



V.LJ OBSERVATIONS WHERE CERTAIN CLAIMS WERE FOUND UNSEARCHABLE 10 



This international search reoort has not been established in respect of certain claims under Article 17(2) (a) for the following reasons: 
1.| j Claim numbers because they relate to suoject matter not required to be searcned by this Authority, namely: 



2. | j Claim numbers because they relate to parts of the international application that do not comply with the prescribed require- 
ments to such an extent that no meaningful international search can be carried out l3 , specifically: 



VI. v j OBSERVATIONS WHERE UNITY OF INVENTION IS LACKING " 



This International Searching Authority found multiple inventions in this International application as follows: 

I. Claims 1-9 are drawn to gene for Mycobacterium 

tuberculosis with plasmid vector and bacterial culture, 
classified in class 536, subclass 27. (SEE ATTACHMENT) . 
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II. Claims 10-15 and 40-42 are directed to vaccine and 
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